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ST55^555F$$T TTG S TCCCAAA ^ AAGACAAGAGAT CCTTGATCTGTGGATCTACCACAC 
G A CCTTC CCGATTA A ACCAGGGTTTCTTCTGTTCTCTAGG A ACT AGA C A CCTAGATGGTGTG 

26 mboll, 50 bini, 

$^55^T$ST T ^9 TGATTGGCAGAATTACACACCA GGGCCAGGGATCAGATATCCACT 
TGT T CCG ATGAAGGG ACTAACCGTCTT AATGTGTGGT CCCGGT C CCTAGTCT AT AGGTG A 

107 bini, 113 ecorS , 

GACCTTTGGATGGTGCTTCAAGCTAGTACCAGTTGAGCCAGAGAAGGTAGAAGAGGCCAA 
CTGGAAA CCTAC CA CG AAGTTCGAT CATGGTC AACTCGGT CTCTT CCATCTT CT 

172 mboll, 
IS£££§£ GAGAA £ AA S A ^^ 

ACTTCCTCTCTTGTTGT CGAACAATGTGGGATACTCGGACGTACCCTACCTCCTGCGCCT 

r$$$^$5T5 TT $5 T 5 TGGAGGTTT6ACAGCAAACTAGCA TTTCATCACATGGCCCGAGA 
CTTTCTTCACAATCACACCTCCAAACTGTCGTTTGATCGTAAAGTAGTGTACCGGGCTCT 

296 aval, 

CGACGTAGGCCTCATGATGTTTCTGACGACTGT AGCT CGA AAGATGTT CCCTGAAAGGCG 
31 4 seal , 

ACCCCTGAAAGGTCCCTCCGCACCGGACCCGCCCTGACCCCTCAC CGCAGGGAGTCTACG 

ACGT ATATT CGTCTGACGAAAAACGGACATGACCCAG AGAGACCAATCTGGT CTAGACTC 
474 bglll, 

rrIrrrTrrIrI?I5^l00^ T $ GGGAACCCAC 

GGACCCTCGAGAGACCGATTGATCCCTTGGGTGACGAATTCGGAGTTATTTCGAACGGAA. 
488 sacl, 518 aflll, 532 hindlll, 

CTCACGAAGTTCATC ACACACGGGCAGACAACACACTGAGACCATTGATCTCTAGGGAGT 

CTGGGAAAATCAGTCACACCTTTTTAGAGATCGTCACCGCGGGCTTGTCCCTGCGCTTTC 
639 narl , 

GCTTTCATCTTGGTCTCCTCGAGAGAGCTGCGTCCTGAGCCGAACGACTTCGCGCGTGTC 
680 sacl , "1 , 

GTTr t55? 555S9SS5 g ^ ga ^^ gg ^ gag ^ a ^^^^ aa tttttgacta^cggaggctagaa^ 

GTTCTCCGCTCCCCGCCGCTGACCACTCATGCGGTTAAAAACTGATCGCCTCCGATCTTC 
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f?^^F95 1 ytS uGluAr 9 PheAlaValAsnPr °Slyl-euLeuGluThrSerGluGlvCv 
^^5^^9T$5^£53 TTCGCAGTCAATCCTG6 tCTGTTAGAAACATCAGAAGGCTG 
TCGTCCCTCGATCTTGCTAAGCGTCAGTTAGGACCGGACAATCTTTGTAGTCTTCCGAC 



s 
C 
CG 



959 pstl 




'sAspThr 



1Q02 binl, 1008 mboll, 

IItI^IttIt^5I^S$$SSS T F ATTGTGTACATCAAAGGATAGA TGTAA^AGACACC 
AATATATTATGTCATCGTTGGGAGATAACACATGTAGTTTCCTATCTACATTTTCTGTGG 

A)f?riA A i?!??y5iiit^5 lleGluGluGluGlnAsnL Y s SerLysLysLysAlaGlnGln 

aaggaag ctttagagaAgatagaggaagagcaaaacaAaagtaA^ 

TTCCTTCGAAATCTCTTCTATCTCCTTCTCGTTTTGTTTTCATTCTTTTTCCGTGTCGTT 
1087 hindlll, 1097 mboll, 1107 nboll, p25 

?iA^i^i?^lf^USIX ThrG1 y AsnSerSerGlnVal SerGlnAsnTyrProIleVal 
§?^ G ^ AG ^ J G ^ AG CJGGtACAGGAAACAGCAGCCAGGTCAGCCAAAATTACCCTATAGTG 
CGTCGTCGACGTCGACCGTGTCCTTTGTCGTCGGTCCAGTCGGTTTTAATGGGATATCAC 

1147 pvull, 11S0 pstl, 1153 pvull, 1156 tthllll, 

rI? A f ?r?ArI?^X51l?!??^?iy i 55 lnAlalleSerProAr 9 T hPLeuAsnAlaTrp 
C AGAA CCTACAGGG5CAAATGGTA 

GTCTTGGATGTCCCCGTTT ACCATGTAGTCCGGTATAGTGGATCTTGAAATTTACGTACC 
1250 ahalll, 1255 ava3 , 

rTl!sX!r?Ir?ili^i , f^^U£ heSerProGluValI1 eProMetPheSerAlaLeu 

g I aaaag I ag J agaa gaaa£ggctttca^ 

CATTTTCATCATCTTCTTTTCCGAAAGTCGGGTCTTCATTATGGGTACAAAAGTCGTAAT 
1275 mboll. 



ThrValGlyGlyHisGln 
CTAAATTTGTGGTAC S AtfTG A i AG J GG ^ 6 ^ A J G CAA 



??AriAr^r^I^S?:?5 lnAs e LeuAsnThrMetLeuA sn 
JCAGAAGGAGCCACCCCACAAGAtTTAAACACCATGCTAAAC 
AGTCTTCCTCGGTGGGGTGTTCTAAATTTGTGGTACGATTTG 

1346 ahalll, 

?rfrr?A?^lA»?^?f , :^ s f ^Thrl leAsnGluGluAlaAlaGluTrpAspArgVal 
5S^5£?^T5? AAA J G J J AAA AGAGACTAT^ 

CGTCGGTACGTTTACAATTTTCTCTGATAGTTACTCCTTCGACGTCTTACCCTATCTCAC 
1423 pstl, 

riT??2rT^rl?^i?5iX^5H5 AlaProG1 i: GlnMetAr g 61up t , o A r9eiySerAsp 

fiTlrrTrI^$I§S$^SSH$T T S£d CCAGS CCAAATGAGAGAACCAAGGGGAAGT^ 

GTAGGTCACGTACGTCCCGGATAACGT6GTCCGGTTTACTCTCTTGGTTCCCCTTCACTG 

1451 sphl, 

H?^if G iY T ^£ Thr SerThrLeuGlnGluGlnIleGlyTrpMetThrAsnAsnProPro 
TATCGTCCTTGAtSI^^^ 

I ATCGTCCTT6ATGATCATGGGAA6TCCTTGTTTATCCTACCTACTGTTTATTAGGTGGA 

ATr???X?Ir^?l^iiSly5 L y sAr 9 Tp E IleIlaLeuSl YLeuAsnLysIleValArg 
tI£££65 TAGGasaa ATCTATAAAAGATGGATAATCCTGGGATTAAATAAAATAGTAAs2 

tagggtcatcctctttagatattttctacctattaggaccctUtttattW 

ATrTX???^^?T!l 1 E?S5: IleLeuAs B Ilo ArgGlnGlyProLy8GluProPhBApaAso 
TACATl A rr^rI^? A ^$IISI5S$?a TAAS * CAAGSACCA 

TACATATCGGGAT6GTCGTAAGACCTGTATTCTGTTCCT6GTTTCCTTGGGAAATTCTCTA 
1636 bstXI, ! 

I??^?i^?2 Ar 9 PheT y rL Y sTh «"LeuArgAlaGluGlnAlaSorGlnAsoValLvsAsn 
ATACATCTGGCCAAGATATTTTr^ 

« i «LATCTGGCCAAGATATTTTGAGATTCTC6GCTTGTTCGAAGTGTCCTACATTTTTTA 
1720 hindlll, 
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I?PA?H!2^5 lu ^ hrLeuLeuVal61nABnA laAsnProAspCysLysThrIleLeuLvs 

ACCTACT6TCTTT6GAACAACCAGGTTTTAC6TTTGGGTCTAACATTCT6ATAAAATTTT 
1796 ahalll, 

A )L?!=f" G iyProAlaAlaThrLeuGluGluMetMetThrAlaCysGlhGlyV 

f^SIIS^cSS^SSfCTACACTAGAAGAAATGATGACAG^ 

CGTAACCCTGGTCGTCGATGTGATCTTCTTTACTACTGTCGTACAGTCCCTCACCCCCCT 
1827 inboll, 

^° G iX"i5'r ys ^ laAr 9 ValLeuAlaG lu A laMetSerGlnValThrAsnProAlaAsn 
CCCGGC C AT AAAG C AAGAGTTTTGG CTGAAGCCATGAGCC AAGTA ACAAATCC ARTTA a? 
GGGCCGGT^TTTCGTTCTCAAAACCGACTTCGGTACTCGGTTCAT^ 

it'Me 
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H?A?^*?rSl? A 5!?5i.X AsnPheAr 9 AsnGlnA rgLysThrValLysCysPheAsnCvs 

6T^T ga T g £ agagagg c aa ttttagGaaccaaagAaagactgttaA^ 



TATTACTAC6TCTCTCCGTTAAAATCCTTGGTTTCTTTCTGACAATTM 




2014 apal, 2019 avr2, 
T£TGrA"A , ??^Apr*'ri^A?£?S^ 

iii^$$55s5$ss5£5 ccaaatgaaaga ' t6 cactgagagacaggctaattttttaggg 
acaccttcccttcctgtggtttactttctaacgtgactctctgtccgattaaaaaatccc 

2102 inboll , 

A?rii?I?Prr???^I?5: , rY55Iy Ar 9 ProG1 i: AsnPheL euGlnSerArQProGluPro 

TTrTl^Ir^SIISST$^ AGGGAAGGCCAGGGAA TTTTCTTCAGAGCAGACCAGAGC^ 

TTCTAGACCGGAAGGATGTTCCCTTCCGGTCCCTTAAAAGAAGTCTCGTCTGGTCTCGGT 

2104 bglll, 2141 mboll, 

T^»^i?^?^°§ luGluSerPh eArqPheGlyGluGluLysThrThrProSerGlnLv«; 

TGTCG6GGTGGTCTTCTCTCGAAGTCCAAACCCCTCCTCTTTTGTTGAGGGAGAGTCTTC 
2175 mboll, 

rlPrl^rS^U?*-? BfrysCluLeuTyrProLeuThrSerLeuArgSerLeuPheGlvAsn 

caggagccgatagacaaggaactgtatcctttaacttccctcagatcactrtttrk^al? 
gtcctcggctatctgttccttgacataggaaattgaAgggagtcta^ 

AspProSerSerGlnOC ) 

GA ^CCTCGTCACAATAAGGATAGGGGGGCAACTAAAGGAAGCTCTATTAGATACAGGA 
CTGGGGAGCAGTGTTATTCCTATCCCCCCGTTGATTTCCTTCGA^TAATCTATGT^ 



931154 



CGTCTACTATGTCATAATCTTCTTTACTTAAACGGTCCTTTTACCTTTGGT^ 
2360 mboll, 2375 bstXI, 

LLCCCTTAACCTCCAAAATAGTTTCATTCTGTCATGCTAGTCTATGGACATCTTTAGACA 

GGXcATAXASCTATArr?Ir actItta^t!^ lelleGlyArg 

CCTGTATTTCGATATCCATGTCATAATCATCCTGGATGTGGACAGTTGTATTAACCTTCT 
2517 mboll, 

AATCTGT?rIr??ir Ii?f £*$ y -?T£S!;S uAsnPhePro1 1 »SerProI leGluThrVa 1 
TTArI§IIS$SISTS5T GG ' ^ G ^ A ^TTTAAATTTCCCCATTAGTCCTATTGAAACTGTA 
TTAGACAACTGAGTCTAACCAACATGAAATTTAAAGGGGTAATCAGGATAACTTTGAC^ 

2548 ahalll, 2S77 tthllll, 

GGTCATTTTAATTTCGGTCCTTACCTACCGGGTTTTCAATTCGTTACCGGTAACTG^ 
2627 ball, 2639 mboll, 

ilAA??U?^? A l a !;5 u y alGluIleC i: aTh rGluMetGluLysGluGlvLvsIleSer 

CTTT$$ A I$$$ A S^IT A 5 TA6AGATATGTA C A G A AATGGAAAAGGAA^ 

CTTTTTTATTTTCGTAATCATCTCTATACATGTCTTTACCTTTTCCTTCCCTTTTAAAG^ 



'I 





2702 



Aa'aATTSG^CCtImAATCCATA^ 

i i AACCCeGACTTTTAGGTAT6TTATGA6GTCATAAACGATATTTCTTTTTTCTGTCA 
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TGATTTACCTCTTT TGATCATCTAAAGTCTCTTGAATT 



2895 ava3, 
2985 ecorS, 

3003 tthllll, 3006 bstXI, 3021 binl, 

mmmmmmmimmmi 

3126 binl, 3171 tthllll, 

3308 pvull f 
34A7 mboll, 
3594 ahalll, 



Glu 
GAG 



CCGGTfACCtGTAfAGfTTAAATAGTTCTCGGTAA^ 
3659 hpal , 
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ThrGluAlaValGlflLysValSe.. . tirG luSerl 1 eVa 111 eTrpGl yLysI \ ^ r'roLvs 
•ACAGAGGCAGTGCAAAAAGTATCCACAGAAAGCATAGTAATATGGGGAAAGATTCCTAAA 
TGTCTCCGTCACGTTTTTCATAG6TGTCTTTCGTATCATTATACCCCTTTCTAAGGATTT 

PheLysLeuProIleGlnLysGluThrTrpGluAlaTrpTrpMetGluTyrTrpGlnAla 
TTTAAACTACCCATACAAAAGGAAACATGGGAA6CATGGTGGATGGAGTATTGGCAAGCT 
AAATTTGATGGGTATGTTTTCCTTTGTACCCTTCGTACCACCTACCTCATAACCGTTCGA 

3723 ahalll, 

ThrTrpI 1 eProGl uTrpG 1 u PheVa 1 AsnThrProProLeuVa I LysLeuTrpTyrGl n 
ACCTGGATTCCTGAGTGGGAGTTTGTCAATACCCCTCCCTTAGTGAAATTATGGTACCAG 
TGGACCTAAGGACTCACCCTCAAACAGTTATGGGGAGGGAATCACTTTAATACCATGGTC 

3835 kpnl, 

LeuGluLysGluProIleValGlyAlaGluThrPheTyrValAspGlyAlaAlaAsnArq 
TTAGAGAAAGAACCCATAGTAGGAGCAGAAACTTTCTATGTAGATGGGGCAGCTAATAGG 
AATCTCTTTCTTGGGTATCATCCTCGTCTTTGAAAGATACATCTA CCCCGTCGATTATCC 

GluThrLysLeuGlyLysAlaGlyTyrValThrAspArgGlyArgGlnLysValValSer 
56^S T 656 TTAG6 ^ A ^ AGCAGG * T ^ TGTTACTGA CAGAGGAAGACAAAAAGTTGTCTCC 
CTCTGATTTAATCCTTTTCGT CCTATACAATGACTGTCTCCTTCTGTTTTTCAA.CAGAGG 

3943 mboll, 

H?^i5^?2T!2^T , 2 rAsnGlnl -Y sThrGlul - euS l nA laIleHisLeuAlaLeuGlnAsp 
ATAGCTGACACAACAAATCAGAAGACTGAATTACAAGCAATTCATCTAGCTTTGCAGGAt 
TATCGACTGTGTTGTTTAGTCTTCTGACTTAATGTTCGTTAAGTAGATCGAAACGTCCTA 

3983 mboll, 

SerGlvLeuGluValAsnneValThrAspSerGlnTyrAlaLe^ 

IS g 5 Ga TT agaagtaaacat agtaacagactcacaatatgcattaggaatcattcaagca 
agccctaatcttcatttgtatcattgtctgagtgttatacgtaatccttagtaagttcgt 

4060 ava3, 

S l"^? A ?ELysSerGluSerGluLeuValSerGlnIleIleGluGlnLeuIleLysLys 

caaccagaYaagagtgaatcagag^ 

gttggtctattctcacttagtctcaatcagtcagtttattatctcgtcaattattttttc 

GAAAAGG ICTACCTGGCATGGGTAC^ 

CTTTTCCAGATGGACCGTACCCATGGTCGTGTGTTTCCTTAACCTCCTTTACTTGTTCAT 
4163 kpnl . j 

rI?Ant?*r?^???^^^U? A 5l3 L y sValLeilPheLeuAsnG1 y IleAs ELysAla 
G ^I A ^ A JI AG IS AG I G CTGGAATCAG I GAAAGTA^^^ 

CTATTTAATCAGTCACGACCTTAGTCCTTTCATGATAAAAACTTACCTTATCTATTCCGG 
4232 seal, 

CAAGAAGAACATGASAAATATCA^ 

GTTCTTCTTGTACTCTTTATAGTGTCATTAACCTCTCGTTACCGATCACTAAAATTGGAC 
4266 mboll, 

?5?S5S^?iy5i^l aL Y sSlulleValAlaS erCySAspLysCysGlnLeuLysGly61u 
£?^9^ J G J AG JAGCAAAAGAAATA6TAGCCAGCTGTGAT^ 

GGTGGACATCATC6TTTTCTTTATCATCGGTCGACACTATTTACAGTCGATTTTCCTCTT 
4352 pvull, 

rrrAT^i??^5i^?!^?8$X5 S 5E ProG1 Y lleTr e 61nl - eilAs E c X sT hrHisLeu 
rrrTl^$I^?S$$SI$^STST55 T S CAGGAATAT ^ 

CGGTACGTACCT6TTCATCT6ACATCAGGTCCTTATACCGTTGATCTAACATGTGTAGAT 
4386 ava3, 4410 bstXI, 4439 xbal. 

CTTCCTTTTTAATAG6ACCATCGTCAAGTACATCGGTCACCTATATATCTTC6TCTTCAA 
4497 Kmnl f 

TAAGGTC6TCTCTGTCCCGTCCTTTGTCGTATAAAAGAGAATTTTAATCGTCCTTCTACC 
4SS5 mboll f 4560 ball,: 
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ProValLysThrlleUTsThrAspAsnGlySerAsnPheThrSerThrThrValLysAla i 
CC AGTAAAAACAATACATAC AGACAATGGCAGCAATTTCACCAGTACT ACGGTTAAGGCC 
GGTCATTTTTGTTATGTATGTCTGTTACCGTCGTTAAAGTGGTCATGATGCCAATTCCGG j 

4605 seal , j 

AlaCXsIrpTrpAlaGlylleLysGinGluPheGlylleProTyrAsnProGlnSepGln ! 
§£ST2TT§S T §2 GCAGG « ATCAA6CA6GAATT TGGCATTCCCTACAATCCCCAAAGTCAA ! 
CGGACAACCACCCGTCCCTAGTTCGTCCTTAAACCGTAA6GGAT6TTAGGGGTTTCAGTT 

4639 binl, 

SlyValValGluSepMetAsnAsnGluLeuLysLysllelleGlyGlnValArqAspGln 
GGAGTAGTAGAATCTATGAATAATGAATTAAAGAAAATTATAGGACAGGTAAGAGATCAG 
CCTCATCATCTTA6ATACTTATTACTTAATTTCTTTTAATATCCTGTCCATTCTCTAGTC 

AlaGluHisLeuLysThrAlaValGHnMetAlaValPhelleHisAsnPheLysArgLys 
G £ TGAA C A CCTTAAGACAGCAGTACAAATGGCA6TATTCATCCAeAATTTTAAAAGAAAA 
CGACTTGTGGAATTCTGTCGTCATGTTTACCGTCATAAGTAGGTGTTAAAATTTTCTTTT 



4752 aflll, 4791 ahalll, 



Sl)i G l):U5 G iX G iYTyrSerAlaGlyGluArgIleValAspIle 

5 GGGG s A nS5S5 GA T6S65 T6CAGGGGAAAGAAT 

CCCCCCTAACCCCCTATGTCACGTCCCCTTTCTTATCATCTGTATTATCGTTGTCTGTAT 
GlnThrLysGluLeuGlnLysGlnl leThrLysIleGlnAsnPheArgValTyrTyrArg 

caaactaAagaactacaaaagcaaatta 

gtttgatttcttgatgttttc6tttaatgtttttaagttttaaaagcccaaataatgtcc 

BES2!=S"lEBLYsGlyProAlaLYsLeuLeuTrpLysGlyGluGlyAla 
GACAACAAAGATCCCCTTTGGAAAGGACCAGCAAAGCTTCTCTGG^ 

CTGTTGTTTCTAGGGGAAACCTTTCCTGGTCGTTTCGAAGAGACCTTTCCACTTCCCCGT 
4954 hindlll, 

y?ir?lU?ll^!e A f5f??$fB I i eL ^ sValValProAr 9 Ar 9 L y sAla, -ysIleIle 

CATCATTATGTTCTATTATCACT6TATTTTCATCACGGTTCTTCTTTTCGTTTTTAGTAA 
5023 raboll, 

a a T Me ^ Gl " A snArgTrpGlnValMetlleValTrpGlnValAspArgMetArgIle 
A ?g A ffiTyrGlYLysGlnfletAlaGlyAspA 

TCCCTAATACCTTTTGTCTACCGTCCACTACTAACACACCGTTCATCTGTCCTACTCCTA 

^ArgTreTrpLysSerLeuValLysHisHisMetTyrlleSerLysLysAlaLysGlyTrp 

T AGAA C A TGGAAAAGTTTAGTAAAACACCATATGTATATTTCAAAGAAAGCTAAAGGATGG 
ATCTTGTACCTTTTCAAATCATTTTGTGGTATACATATAAAGTTTCTTTCGATTTCCTACC 

5131 ndel, 

?T?^? A ??yi?tJf5l^5§l uSerThrHisProAr g ValSerSerS l uV alHisIle 

IIIItI$5?^IS5ST$T55S AGTACTCATCCAAGaGtaagttcagaag tacacatc 
aaaatatctgtagtgatactttcatgagtaggttctcattcaagtcttcatgtgtag 

5185 seal, 

ES?^? 1 * 6 . iY A fE A l2^ s tSy VallleThrThrT Y rTr B sl yLBuHisThpGly6luArg 
"CCTAGG6GATGCTAAATT6GTAATAACAACATATTGGGGU 

GGGGATCCCCTACGATTTAACCATTATTGTTGTATAACCCCAGACGTATGTCCTCTTTCT 
5223 avr2, 

f luTrpHisLeuGlyGlnGlyValAlallsGluTrpArgLysLysLysTyrSerThrGln 

CTTACCGTAAACCCGGTCCCTCA6CG6TATCTTACCTCCTTTTTCTTTATATCGTGTGTT 

^ ?l£f B?S9 G iXkS uA ^ aAs B slnL *ull«HisLeuHisTyrPheAspCysPh0SerGlu 
SI^SSSTSSSCTAGCAGACCAA^ 

CATCTGGGACC66ATCGTCTGGTTGATTAAGTAGACGTAATAAAACTAACAAAAAGTCTT 
???^l?H? , r^? As 2^i? 1 I a, - euG1 y T y rAp 9 Vals « rProA «'9Cy.« G luTyrGlnAla 
AGACGATATTTTTTACGGTATAATCCTATATCTCAATCAG6ATCCACACTTATAGTTCGT 

S^*V0 avr2 t 

5lY^E AsnL y sValsl y SerLeuGlnT y rLeuAl * l -euAlaAlaLeuIleThrProLys 

CCTGTATTGTTCCATCCTA6AGATSTTATGAACC6TGATC6TCGTAATTATT6T6GTTTT 
5W6 binl, 
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SS21 
5581 

; 5641 

5701 

5761 
5821 

5881 

5941 

6001 

■6061 
6121 

6181 

6241 



LyBThrLysProProLeuProSerValLysLyBLeuThrGluAsoAroTpnAeoi w~d..« 

aagacaaJgccacctttgcctagtgttaagaaact^ 
ttctgtttcggtggaaacggatcacaattctttgactgtct^^^ 

SiPA^?T^ l ry5 G1 Y HisAr 9 G1 y SerHis ThrMetAsnGl vHisAM 

CAGAAGACCAAGGGcCACAGAGGGAGCCATACA ATGAATGGACACTAGAGCTTTTArArr 
GTCTTCTGGTTCCCGGTGTCTCCCTCGGTATGTTACTT^ 

5583 mball , 

0^TTt$5$5$5^ SCTGTTAGACATTTT CCTAGGCCATGGCTCCATAGCTTAGGACAAT 
.TCGAATTCTCTCTTCGACAATCTGTAAAAGGATCCGGTACC^ 

5643 aflll, 5670 avr2, 5676 ncol , 

6T£ T £ TA ^ GAAA C TTAT GGGGATACTTGGGCAGGAGTGGAAGCCATAATAAGAATTrTKr 

TATAGATACTTTGAATACCCCTATGAACCCGTCCTCACCTTCGGTATTATT^ 

5752 ecorl . 

5836 mboll, 5862 xbal, 

6GTCCTTCAGTCGGATCCTGACGAACATTGTTAACGATAACATTTTTCACAACGAAAGTA 
5893 avr2, 

5945 mlul, 5988 mball, 

6008 nboll, 6011 sacl , 6016 mstll, 6038 hindlll, 
6147 seal, 



MetLys 



6222 mball, 

I- wmmmmmmmmmm 

6410 nd.l, 

&481 GTAGTATTGGgXaATGTGAcSga^ 

6541 a TGCAgIaGGA?ATAATCAGTTTAT^ 

TACSTCCTCCTATATTAGTCAAATACCCT^ 
6567 binl, 
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ENV 





&601 

v 

S721 
S781 

&841 

>901 
6961 
7021 

7081 

7141 

7201 
7261 
7321 

7381 
74*1 
7S01 
7S61 




6615 ahalll, 



A ATTGGAAAGAAGAAATAAAAGGAGAAATAA A AAACTGCT CTTTC A AT AT CA rrir a Irr 
TTAACCTTTCTTCTTTATTTTCCTCTTTATTTTTT^I^ 



6670 mboll, 



1 1 eArqAspLvsI leGlnLysGluAsnAl aLeuPheAroAsnLeuA«aUai u,,p r „n 0 

ataagXgaJaJgattcagaLgaaaatgcactttt^ 

TATTCTCTATTCTAAGTCTTTCTTTTACGTGAAAAAGCATT6GAACTACATCATGGTTAT 



£?? A ?5£i£??i: ThrThrT hrAsnTyrThrAsnTyrArqLeuI 1 eH is CvsAsnAraSer 

CTATTACGATCATGATGATGGTTGATATGGTTGATATCCAACTATGTAACATTGTCTAGT 
6790 seal, 

Val HeThrGlnAlaCysProLysValSerPheGluProI lpPmiipHi =Tv>^rw=TK^ 

GTCATTACACAGGCCTfeTCCAAAGGTATCATTTGAGCCAATTCCCA^ 

CAGTAATGTGTCCGGACAGGTTTCCATAGTAAACTCGGTTAAGG^ 

6851 stul, 

P?o A laSlyPheAlaUeLeuLysCys^ 

7042 mboll, 7045 mboll, 7060 bglll, 

TTACGATTTTGGTATTATCATGTCGACTTACTTAGACATCGTTAATTGACATGTTCTGGG 
7102 pvull, 1 

7199 mboll , 

TCTTATTATCCTCTATATTCTTTTCGTGTAACATTGTAATCATCTCGTGTTACCTTATTG 
ACTTTAfiiXrirATAW^ 

TGAAATCTTGTCTATCA ATTTTTTA^ 

i tiAAA r CTTGTCTATCAATTTTTTAATTCTCTTGTCAAACCCTTATTATTTTGTTATCAG 
7331 rastll, 
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.7421 
7681 

77*1 
7801 
7861 
7921 
,7981 

8041 

8101 
8161 

8221 
8281 
83*1 

8401 

S 

8461 
8521 




TG6CTCCA6AAGTCTGGACCTCCTCCTCTATACTCCCT6TTAACCTCTTCACTTAATATA 
7628 mboll, 

I I I ATATTTCATTATTTTTAACTT6GTAATCCTTATCGTG66T6GTTCCGTTTCTCTTCT 
7736 mboll, 

CTTTCTATGGATTCCCTAGTTGTCGAGSATCCCTAAACCCCAACSAGACCTTTT^ 
7989 mstll, 7995 binl, 8007 avr2, 

f5E A ?D L y sSerLeuS l"AsplleTrp 
AGTAATAAATCTCTGGAAGACATTTGS 
CTCATTATTTAGAGACCTTCTGTAAACC 
8089 mboll, 

8170 mboll, 

mmmmmmmimmm 

8400 aval, 

8431 mboll, 8434 mboll, 

*i?l£g 6lu A»pLeuArqSerLeu 




8S03 mboll, 8S0S bglll, 
8S25 mboll, 
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asai 

3641 
,3701 

•3761 

3822 
3882 

,^8942 
9002 

9062 

9122 

9182 

9242 

9302 
9362 

9422 
9482 




ATTCTGGGGCACAG-GSGGTGGGAAGCCCTCAAATAT^ 

TAAGACCCCGTGTCCCCCACCCTTCGGGAGTTTATAACCACCTCAGAGGACGTCATAACC 
8629 pstl, 

H?51?5iftS uLysAsnSerAlaValSerT PpLeuAsnAlaThrAlaHeAlaValThr 
$II55§§ A 5 CTAAA6AA TAGTGCTGTTAGCTGGCTCAACGCCACAGCTATAGCAGTAACT 
TAAGTCCTTGATTTCTTATCACGACAATCGACCGAGTTGCGGTGTCGATATCGTCAT^ 

^"GlyThrAspArgVal IleGluValAlaGlnArgAlaTyrApgAlalleLeuHisIle 
S A SSSS A S A §ATAGflGTTATAGAAGTAGCACAAAGAGCTT^ 

CTCCCCTGTCTATCCCAATATCTTCATC6TGTTTCTCGAATATCTCGATAAGAGGTGTAT 

£i? Ar -S A S9HS A £9 GlnG1 y. L euGluAp gLeuLeuLeuOC MetGlvGl vLvsTroSeV 

r A I A ^^$ A II$§5Sa GGG CTTGGAAAGGCTTTTGCTATAAGATGGGtGG?AXiTG^ 

6TATCTTCTTAATCTGTCCCGAACCTTTCCGAAAACGATATTCTACCCACCGTTCACCAGT 

8765 mboll, 





8883 tthllll, 8916 aval xhol , 

Ji?I^A???f5^ As D ThrAlaAlaThrA snAlaAspCysAlaTroLeuGluAlaGlnGlu 

A IC A CAAGTAGCAATACAGCAGCTACTAATGCTGATT6TGCCTG6C^ 

TAGTGTTCATCGTTATGTCGTCGATGATTACGACTAACACGGACCGATCTTCGTGTTCT^ 



etThrTyrLys 



rlrr l^l^^i5i^£tJ eProValAr 9 ProG lnValProLeuArgProM 

gaggaagag H5§5' TTTCCAGTCAGa cctcaggtaccttta^ 

CTCCTTCTCCACCCAAAAGGTCAGTCTGGAGTCCATGGAAATTCTGGTTACTGAA^ 
9005 mboll, 9029 instil, 9031 kpnl. 




9085 ahalll, 

AGGGTTTCTTCTGTTCTCTAGGAACTAGACApTAGATGGTGTGTGTTC^ 
9129 mboll, 9153 binl, 

rf l?»f 2l y ?: ThrProG1 X ProG1 y I1 eArgTyrProLeuThrPheGlvTrDCvs 

r$II^^5 AATTACACA CCAGGGCCAGGGATCAGATftTCCACTGACCT^ 
CTAACCGTCTTAATGTGTGGTCCCGGTCCCTAGTCTATAGGTGACTGGAAACCT^ 
9210 binl, 9216 ecorS , 

T^r^rr?Ar?l^?^$i5iy^ oSluL Y sValG1 u G lu A l aA snGluGlyGluAsnAsn 

IIrT?^I$§I$S£5§T TGAGCCAGASAAG GTAGAAGAGGCCAATGAAGGAGAGAACA^ 

AAGTTCGATCATGGTCAACTCGGTCTCTTCCATCTTCTCCGGTTACTTCCTCTCTTGTTG 

9275 mboll " * 

i 

|"L?"!=?«HisProMetSerLeuHisGly^ 

TrrIIrIlT^SSI$IS$§S£ TGCATGGGATGGAGGA CGCGGAGAftAG^ 
TCGAACAATGTGGGATACTCGSACGTACCCTACCTCCTGCGCCTCTTTCTTCACAATCAC 

TrP A r??5?^fBf?£V ys tS uAlaPneHi *HisMetAlaApgGluLeuHisPpoGluTvp 
a1cTCCAAACTGTCSTTt£I^^^ 

«v.v. i tCAAACTGTCGTTTGATCGTAAAGTAGTGTACCGGGCTCTCGACGTAGGCCTCATG 
9399 aval, 9417 .seal , J" . 

TyrLysAspCysOP , 

9536 pvull , 
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j 



7602 



5542 GAaIIIcggIcI^^ 

SAAAAACGGACATQTNCCCAGAGAGACCAATCTGGTCTAGACTCGGACCCTCGAQAGACCG 

9S76 bgllt, 9S90 sacl , 

ATTGATCCCTTGGGTGACGAATTCGGAGTTATTTCGAACGGAACTCACGAAGTTCATCAC 
9620 •flit, 9434 hindlll, 

9662 TSTGCCCGTCTGTTGTGTGACTCTGGTAACTAGAGATCCCTCAGACCrTTTTArTrArT/- 
j ACACGGGCAGACAACACACTGAGACCATTGATCTCTAGGGAGTCTGGG^ 



A 

9722 TGGAAAAATCTCTAGCAG 
ACCTTTTTAGAGATCGTC 
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ARV GAG pl6 - synthetic Parts A and B 

TflTTATGCAAAGAGGTAACTTCAGG 
ftTAATACGTTTCTCCATTGAAGTCC 

10 mnll, 23 hinll, 
76 ddei, 88 ban2 hgiA hgiJ11 saol sdul , 89 alul 
llrtl* tE2 ' 131 hph » 1A8 raaI ' 161 178 bglll Kho2/l%9 

183ball cfrl hael, 184 haelll, 199 acd , 204 apyl ecorl! sc 
249 mboll, 267 hph, 270 mboll, 

303 ccaa^J&agIa^^^ 

GGTTAGCTGTTCCTTAACATGGGTAACTGGAGAAA^ 

f^^ll^S^al^Jo?? 11 ' 339 * h ° 2 ' 3 *° s * u3 *> 3 5 7%au 3 ; 

... SerSerGlnOP All 
363 TCGAGCCAATGATAG ' 

AGCTCGGTTACTATCAGCT 

363 taq1, 377 accl hindll sail 
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5880 



62 



122 



39*0 



9oco 



182 



242 



302 



362 



422 



482 



542 



602 



662 



AGGXAACAG: 
TCCXTTGTC: 



Met II eAspLysAlaGlnGluGluHisGluLysTyrHisSerAsnTrp 
: ATGAT : GA : AAGGCACAAGAAGAACATGAGAAATATCACAGTAATTGti 
: TACTA : CT : TTCCGTGTTCTTCTTGTACTCTTTATAGTGTCATTAACC 



32 jnball , 38 nlalU, 

ArgAlaMetAlaSerAspPheAsnLeuProProVal Val A 1 aLysGluI leVal Al aSer 
AGAGCCATGGCTAGT6ATTTTAACCTGCCACCTGTAGTAGCAAAAGAAATAGTAGCCAGC 
TCTCGGTACC6ATCACTAAAATTGGACGGTGGACATCATCGTTTTCTTTATCATCGGTCG 



66 ncol , 67 nlalll, 118 nspBII pvull, 119 alul, 

CysAspLysCysGlnLeuLysGlyGluAlaMetHisGlyGlnVal AspCysSerProGl y 
TGTGATAAATGTCAGCTAAAAGGAGAAGCCATGCATGGACAAGTAGACTGTAGTCCAGGA 
ACACTATTTACAGTCGATTTTCCTCTTCGGTACGTACCTGTTCATCTGACATCAGGTCCT 



135 alul, 151; 
76 apyl bstXl 1 



nlalll , 152 nsil 
ecarll scrFl, 



ava3, 155 nlalll. 164 accl 



I leTrpG InLeuAspCysThrHisLeuGluGlyLysI lei leLeuVal AlaValHisVal 
ATATGGCAACTAGATTGTACACATCTAGAAGGAAAAATTATCCTGGTAGCAGTTCATGTA 
TATACCGTTGATCTAACATGTGTAGATCTTCCTTTTTAATAGGACCATCGTCAAGTACAT 

198 rsal, 205 xbal, 223 apyl ecorll scrFl , 236 nlalll, 

AlaSerGlyTyrlleGluAlaGluVallleProAlaGluThrGlyGlnGluThrAlaTyr 
GCCAGTGGATATATAGAAGCAGAAGTTATTCCAGCAGAGACAGGGCAGGAAACAGCATAT 
CGGTCACCTATATATCTTCGTCTTCAATAAGGTCGTCTCTGTCCCGTCCTTTGTCGTATA 

263 xmnl , 

PheLeuLeuLysLeuAl aGlyArgTrpProValLysThrlleHisThrAspAsnGlySer 
TTTCTCTTAAAATTAGCAGGAAGATGGCCAGTAAAAACAATACATACAGACAATGGCAGC 
AAAGAGAATTTTAATCGTCCTTCTACCGGTCATTTTTGTTATGTATGTCTGTTACCGTCG 

321 mboll, 326 ball cfrl hael , 327 haelll, 357 bbv f nu4hl , 

AsnPheThrSerThrThrValLysAlaAlaCysTrpTrpAlaGlylleLysGlnGluPhe 
AATTTCACCAGTACTACGGTTAAGGCCGCCTGTTGGTGGGCAGGGATCAAGCAGGAATTT 
TTAAAGTGGTCATGATGCCAATTCCGGCGGACAACCACCCGTCCCTAGTTCGTCCTTAAA 

366 hph, 371 seal, 372 rsal, 385 haelll, 386 fnu4h1 nsbll , 4 
05 binl, 406 dpnl sau3a, 

GlylleProTyrAsnProGlnSerGlnGlyValValGluSerMetAsnAsnGluLeuLys 
GGCATTCCCTACAATCCCCAAA6TCAAGGAGTAGTAGAATCTATGAATAATGAATTAAAG 
CCGTAA6G6ATGTTAGGGGTTTCAGTTCCTCATCATCTTAGATACTTATTACTTAATTTC 

423 bsml, 458 hinfl, 

LysIlelleGlyGlnValArgAspGlnAlaGluHisLeuLysThrAlaValGlnMetAla 
AAAATTATAGGACAGGTAAGAGATCAGGCTGAACACCTTAAGACAGCAGTACAAATGGCA 
TTTTAATATCCTGTCCATTCTCTAGTCCGACTTGTG6AATTCTGTCGTCATGTTTACCGT 

503 dpnl sau3a, 518 aflll, 530 rsal, 

ValPhel leHisAsnPheLysArgLysGlyGlylleGlyGlyTyrSerAlaGlyGluArg 
GTATTCATCCACAATTTTAAAAGAAAAGGGGGGATTGGGGGATACAGTGCAGGGGAAAGA 
CATAAGTAGGTGTTAAAATTTTCTTTTCCCCCCTAACCCCCTATGTCACGTCCCCTTTCT 

547 fokl, 557 ahalll, 

IleValAspIlelleAlaThrAspIleGlnThrLysGluLeuGlnLysGlnlleThrLys 
ATAGTAGACATAATAGCAACAGACATACAAACTAAAGAACTACAAAAGCAAATTACAAAA 
TATCATCTGTATTATCGTTGTCTGTATGTTTGATTTCTTGATGTTTTCGTTTAATGTTTT 

605 accl , 

1 1 eGl nAsnPheArgValTyrTyrArq AspAsnLysAspProLeuTrpLysGly ProAl a 
ATTCAAAATTTTCGSGTTTATTACA6SGACAACAAA6ATCCCCTTTGGAAAGGACCAGCA 
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uysi.eui.eu I rpcysb i y<a iuu iyAlava l vc 1 1 1 eGlnAspAsnSerAspI l eLysVa 1 
7 22 AASCTTCTCTGGAAAGGtGAAGGGGCAGTAGTAATACAAGATAATAGTGACATAAAAG.TA ' 
TTCGAAGAGACCTTTCCACTTCCCCGTCATCATTATGTTCTATTATCACTGTATTTT CAT 

722 hindlll, 723 alul, 737 hph, 

ValProArgArgLysAlaLysI lelleArqAspTyrGlyLysGlnMetAlaGlyAspAsp 
•Y&O 782 GTGCCAAGAAGAAAAGCAAAAATCATTAGSGATTATGGAAAACAGATGGCAGGTGATGAT 

CACGGTTCTTCTTTTCGTTTTTAGTAATCCCTAATACCTTTTGTCTACCGTCCACTACTA 

789 mboll , 833 hph, 

CysValAlaSerArqGlnAspGluAspAM 
#£OG a * 2 TGTGTGGCAAGTAGACAGGATGAGGATTAGTCGAC6GAATTCTTTAGTAAAACACC 

ACACACCGTTCATCTGTCCTACTCCTAATCAGCTGCCTTAAGAAATCATTTTGTGG 

B52 accl, 859 f oki , 863 mnll, 871 accl hindll sail, 872 taql 
, 878 ecorl , 
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St^aT^L7sAlaValC7S^alI.euL7sGl7AsrGlvProVaIGInGl7lleIleA3a 
1 CATGGCGACGAAGGCCGTG7GCGTGCTGAAGGGCGA 

G^CTGCTTCCGGGACACGGACGACTTCCCCCTGCCGGGTC^CGTCCCGTAGTAGTTA 

PheGluGlnL7sGluSerAsnfil7ProValL7sValTrpGl7SerIIsL7sGl7l^uThr 
62 TTCGAGCAGAAGGAAAGTAATGGACCAGTGAAGGTGTGGG^ . . . 

AAGCXCSTCrrCCTrrCAKaCCTGGTC\CTTCC^ 

GIuGl7LeuHisGl7PheHis t /alHisGluPheGlvAsoAsnThrAlaGl7C7sThrSer 
122 GAAGGCCTGCATGGATTCCATGTTCATGAGTTTGGAGATAATAC\GCAGGCTGTACG^GT 
CTTCCGGACGTACCTAAGGTAC\AGTACTCAAACCTCTATTATGTCGTCCGACATGGTCA 

AlaGl7PrcHis?heAsaPrcLeuSerArgLvsKisGlyGl7ProL7sAspGluGluArg 
132 GCAGGTCCTCACTTTAATCCTCTATCC^GAA^CACGGTGGGCCaAAGGATGAAGAGAGG 
CGTCCAGGAGTGAAATTAGGAGATAGGTCTTTTGTGCCACCCGGTTTCCTACTrCTCTCC 

HisValGlvAs3LeuGl7AsnValThr^^ 
242 CATGTTGGAGACTTGGGC\AXGTGACTGCIGAC\AAGATGGTGTGGCCGATGTGTCTAXT 
GTACAACCTCiGAACCCGTTACACTGACGACTGTTTCTACCAC\CCGGCTAC\CAGATAA 

GluAsoSer , / , alIleSerLeuSerGlvAsaHisC7srleIleGlTArgThrleuValVal 
302 GAAGATTCTGTGATCTCACTCTCAGGAGA^ 

CTTCTAAGACACTAGAGTGAGAGTCCTCTGGTAACGTAGTAACCGGCGTGTGACCACCAG 

HisGluLvsAlaAscAsoLauGlvLvsGl T /Gl7AsnGluGluSerrhrl7sThrGl7Asn 
362 CATGAAAAAGCAGATGACTTGGGC\^ 

GT 2 t £T M ?"^ ,m ''*G"**C*^" AC^GA \Q m ?*^''^ l ~£. rr, ™ f ~ , f m \Q£? T * f ±** \c* T " T *C'^ M r'^ , CA r, *GT mr ^C*^GTCCXXXG 

ZNV 53 

AlaGlvSerA^rLeuAlaCvsGlyVallleGiyll^ 
422 GCTGG^GTCGTTTGGCTTGTGGTGTAATTGGGAXCGCCATGGCTATCGAAGCTCAACA^ 
CGACCTTCAGCAAACCGAACACCACATTAACCCTAGCGGTACCGATAGC'TTCGAGTTGTT 

HisLeuLeuGlnLeuThrYalTrpGlvIleL7sGlnLeuGlaAlaArgValLeuAlaVal 
432 CACTTGCTGCAGTTGACCGTTTGGGGTATCAAGCAGTTGCAGGCTAGAGTTTTGGCTGTT 
GTGAACGACGTCAACTGGCAAACCCCATAGTTCGTCAACGTCCGATCTCAAAACCGACAA 

GluArgTyrle-JL^rzAscGlaGlnLeuLeuGlvIleTrpGlTCysSerGlvLysLeuIle 
542 GAAAGATACTTGAGAGATCAACAATTGTTGGGTATCTGGGGTTGXTCTGGTAAGTTGAXT 
CTTTCTATGAACTCTCTAGTTGTTAACA^CCCATAGACCCC\AC\AGACCATTCAACTAA 

CysThrXhrAlaVaiProTrrAsn^ 
60 2 TGTACCACCGCTGTTCCCTGGAACGCTTCTTGGTCTAACAAGTCTTTGGAAGACATCTGG 
ACATGGTGGCGACAAGGGACCTTGCGAAGAACCAGATTGTTCAGAAACCTTCTGTAGACC 

AspAsaMetTlirTrcMei^lnTrpGluAr^ 
652 GACAACATGACCTGGATGCAAXGGGAAAGAGAAATCGACAACTACACCAACACCATCTAC 
CTGTTGTACTGGACCTACGTTACCCTTTCTCTTTAGCTGTrGATGXGGTTGTGGTAGATG 

TJirI.euLeuGluGluSerGlnAsnGlnGlnGluL7S As nGluGlnG luLeuLeuGluLeu 
722 /UZCTTGTTGGAGGAATCTCAAAACC^ 

TGGAACAACCTCCTTAGAGTTTTGGTTGrc 

AsoLysTrpAlaSerLeuTrpAsnTrpPheSerlleThrAsnTrpAM 
782 GACAAGTGGGCAAGCTTGTGGAACTGGTTCTCTATCACCVACTGGTAG 

CTGTTCACCCGTTCGAACACCTTGACCAAGAGATAGTGGTTGACCATCAGCT 

:i 

Translated Mol. Weight - 30414.22 
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7AV.CGATGTTTCCGA 



1363 



V*Ky*V*lL»uLy*GlyA«pGlyProValGlnGlyllel I «A*nPh«GluGlnLy*Glu 
GTTTGTGTTT7GAAGGGTGACGGCCCAGTTCAA6G7ATTATTAACTTCGAGCAGAAGGAA 
CAAACACAAAACTTCCCACTGCCGGGTCAAGTTCCATAATAATTGAAGCrCGTCTICCTT 

SepA«oGlyProV-ill.y»ValTrpGlySerIl«Ly«GlyL*uThrGluGlyL«u(U*Gly 
K.43 AGTAATGGACCAGrGAAGGTGTGGGGAAGCATTAAAGGACTGACTGAAGGCCTGCATGGA 
TCATTACCTGGTCACTTCCACACCCCTTCGTAATTTCCTGACTGACTTCCGGACGTACCT 

Ph*Mi*y*IHi*GtuPh«GlyA*pA*nThrAl 4 GlyC y sThrSerAlaGlyProHi»P»»« 
TTCCATGTTCA T G AG T T TGGAGA T A A T ACAGCAGGCTGTA C C AG T GCaGGT CC T CAC T 
fAAGGTACAAGTACTCAAACCTCTATTATGTCGTCCGACATGGT c A CG T CCaGGaG T GAAA 

AsnProL«uSerArqLy*Hi«61yGlyPpoLy*A6pGl«GluArqHi S ValGlyA*pUeu 
AATCCTCTATCCAGAAAACACGGTGGGCCAAAGGATGAAGAGAGGCATGTTGGAGACTTG 
TTAGGAGATAGGTCTTTTGTGCCACCCGGTTTCCTACTTCTCTCCGTACAACCTCTGAAC 

Gt y A«nValThrAl*AspLy«AspGlyValAlaAspV*lSecl leGluAspSerVaUle 
GGCAATGTGACTGCTGACAAAGATGGTGTGGCCGATG7GTCTATTGAAGATTCTGTGATC 
CCGTTACACTGACGACTGTTTCTACCACACCGGCTACACAGATAACTTCTAAGACACTAG 

S*rL«uSerGlyA*pHi*Cy«HeIleGlyArqThrL»uV4lValHisGluUy*AlaAsp 
TCACTCTCAGGAGACCATTGCATCATTGGCCGCACACTGGTGGTCCATGAAAAAGCAGAT 
AGTGAGAGTCCTCTGGTAACGTAGTAACCGGCGTGTGACCACCAGGTACTTTTTCGTCTA 

AspLeuGlyLysGlyGlyA*nGluGluSerThrLysThrGlyAsnAlaGlySerArgL«u 
GACTTG3GCAAAGGTGGAAATGAAGAAAGTACAAAGACAGGAAACGCTGGAAGTCGTTTG 
CTGAaCCCGTTTCCACCT TTaCTTCT TTCATGTT T CTGTCCTTTGCGACCTT CaGCAAAC 

linker — > P 3t > 

AtaCy5GlyV*llleGlylleAl*GlnAsnSerGlyValGlyAl*rtetAl*rietAlaSer 

1303 rrTTftT '' r/TTftTAA TT."/:rtATrfircCAGAATTCAGGTGT TGGAGCCATGGCCA TGGCTAGT 
rr T , a r,rr A rATT^ A rrrTAr,ffif,GTCTTAA GTCCACAACCT CGGTACCGGTACCGATCA 

A S p?M.A*nL«uProProVilValAlaLysGluLl#ValAlaSerCysAspLysCysGln 

1843 GATT77AACCTG C CA CCTGT AGT AGCAA AAG AAA T AGTAG C C AGC TGT GAT AAATGTCAG 
CTAAAATTGGACGGTGGACATCATCGTTTTCTTTATCATCGGTCGACACTATTTACAGTC 

HuLy*GlyGiuAl*H*tHi«6Iy61nV*lA*pCysS€PPpo61y 1 1 eTrpG 1 nLeuA* p 
1923 CTAAAAGGAGAAGCCA7GCATGGACAAGTAGACTGTAGTCCAGGAATATGGCAACTAGAT 
GA7T7TCCTC7TCGGTACGTACCTGTTCATCTGACATCAGGTCCTTATACCGTTGATCTA 

CvsThcHi^LeuGluGlyUysIlell.LeuValAlaValHisValAlaSerGlyTyrlle 
7G 7 ACA Ca T CTaG AAGG AAAA A 7 t atcc tggt agcagtt CaTGTAGCCaGTGGATATaTA 
ACA7G7G7AGAT C TT CCT T7T7 AATAGGAC CA7CGT CAAGT AC AT CGG7 CA CCTAT AT AT 

GluAIaGluVilIleProAlaGIuThcGlyGlnGluThrMa^ 

GAAGCAGAAG7TA7TCCAGCAGAGACAGGGCAGGAAACAGCATATT77CTCTTAAAATTA 
C77C3TC7TCAA7AAGGTCGTCTCTGTCCCGTCCTTTGTCGTA7AAAAGAGAATTTTAAT 



1S03 



tS63 



1&23 



1633 



17^: 



2CK3 



21a: 



2i a: 



2223 



AlaGlyArq7rpProValLysThrlleHisThpA* P AsnGlySerAsnPheThrSerThr 
G C AG5A AG A TGG CCAG T AAAAACA A T ACA T AC A6ACAAT GG CAGCAA T T T CACCAGT AC T 
CG7CCT7C7ACCGG7CA77T7TGTTATGTATGTCTGTTACCG7CGT7AAAGTGGTCATGA 

ThrValUysAlaAlaCysTrpTrpAlaGlylleLysGlnGluPheGlylleProTyrAsn 
AC5G77AAGGCCGCC7GTTGGTGGGCAGGGAT CAAGCAGGAAT TT GGCATTCCCT ACAAT 
7GCCAAT7CCGGCGGACAACCACCCG7CCCTAGTTCGTCCTTAAACCGTAAGGGATGTTA 

PraGlnS«rGlnGlyV*lValGluSerneLA*nAsnGluL«uLysLy«U«neGlyGln 
CC C CAAAGTCAAGGA6T AGT AGAAT C T ATGAA TAATGAATT A A AG AA AAT T ATA6GACAG 
G5GG7TTCAGT TCCTCATCATCTTA6ATACTT ATTACTT AATTTCTTT TAATATCCT6TC 

V*UrgA«pGlnAlaGluHi,t.uLysThrAlaValGlnrietAl^ 
2263 G7 AAGAGATCAGGCTGA AC ACCT T A AGAC AGC AG T ACAAAT GG CAGT AT T CAT C CACAAT 
CAT7CTCTAGTCCGACTTGTGGAATTCTGTCGTCATGTTTACCGTCATAAGTAGGTGTTA 

PheLy^AcgUysGlyGlylleGlyGlyTyrS.rAlaGlyGluArgll.ValAspllell. 
T77AAAAGAAAAGGGGGGA7TGGGG6ATACAG7GCAGGGGAAAGAATAGTAGACATAATA 
AAA7TTTCTTTTCCCCCCTAACCCCCTATGTCACGTCCCCTTTCTTATCATCTGTATTAT 

AlaThrAs P tl.Gl«ThrLy*GluL.uGlnLy*GlnIleThcLysI^ 

GCaaCaGaCA TaCa aaC TaaaGaaC T aCaaaaGCaaaT T ^****^^^5g£ 
C57TGTCTG7ATGTTTGATTTCTTGAT6TTTTCGTTTAATGTTTTTAAGTT7TAAAAGCC 

V4lTyrTycAcgAspA*nLy.A,pProL«uTrpLy*GlyProA^ 
GTTTATTACAGGGACAACAAAGATCCCCTTTGGAAAGGACCAGCAAAGCTTCTCTGGAAA 
CAAATAATG7CCCTGT TGT TT CTAGGGGAAACCTT TCCT6GT CGTTTCGAAGAGACCTTT 

GlyGl«GlyAl*VaIValIl.GloA. P A*nS.rA* P M.Ly.ValValProArqArgLy. 
GGTGAAGGGGCA6TAGTAATACAAGATAATAGTGACATAAAAGTAGTGCCAAGAAGAAAA 

ccacttccccstcatcattatgttctattatcactgtattttcatcacggttcttctttt 

Al*LysIl«U«AcqA«pTyrGlyLy«Glnn«tAlaGlyA*pA«pCy*ValAl*S«rApg 

2S83 gcaaaaatcattagggattatggaaaacagatggcaggtgatgattgtgtggcaagtaga 
cgtttttagtaatccctaataccttttgtctaccgtccactactaacacaccgttcatct 

G I nA* pG 1 uAs pAn 
26^3 caGGaTGaGGATTaG 

gtcctactcctaatc FIGURE 24 
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Argument Map in DNA Strand ssarv2 
from the '/v/lib/6mers'file. 
Translation shown at open reading frames. 
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ii ii 
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mboll-1 mboll-1 
binl aval-2 
binl seal 
ecor5 



bglll 
sacl 
af 111 
hindlll 



narl 
sacl 
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binl 
mboll-1 
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i 
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. !__! !__!_! ! ! !_!__!__« !_! 

avr2 mbbll-1 ecorl avr2 mboll-1 
binl af 111 mboll-1 mboll-1 

avr2 xbal sacl 

ncol mlul hindlll 

mstll 



ndel avr2 
seal 



_! ! ! !__!__! !__! ! !_!__ 

seal ndel binl mboll-1 stul mboil-1 

mboll-1 ahalll seal mboll-1 

bglll 
pvull 

___! ! ___! ! j i j 

mboll-1 mboll-2 mstll 

mstll mboll-1 binl 

avr2 

mboll-1 

! !!!___!! ! ! \_\ !_!__! ! \ _ 

mboll-1 aval-2 pstl mboll-1 "aval-i ahalll 

mboll-1 tthIIIl-2 mboll-1 

mbol ^Kin i xho1 m stII binl 

pboll-1 mboll-1 

b ^oll-2 kpnl 

_! !__! ! i !__,! 1 1 1_ 

ecor 5 aval-2 pvull 

mboll-1 bglll 
... seal sacl 

binl af in 

hindlll 
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1 CTGGAAGGGCTAATTTGGTCCCAAAGAAGACAAGAGATCCTTGATCTGTGGATCTACCAGA'C 
GACCTTCCCGATTAAACCAGGGTTTCTTCTGTTCTCTAGGAACTAGACACCTAGATGGJGTG 

26 mboll, 50 binl , r i : ■[:.: 

63 ACAAGGCTACTTCCCTGATTGGCAGAATTACACACCAGGGCCAGGGATCAGATATCCAGT 
TGTTCCGATGAAGGGACTAACCGTCTTAATGTGTGGTCCCGGTCCCTAGTCTATAGGTGA v- 

107 binl, 113 ecor5, 

123 GACCTTTGGATGGTGCTTCAAGCTAGTACCAGTTGAGCCAGAGAAGGTAGAAGAGGCCAA 
CTGGAAACCTACCACGAAGTTCGATCATGGTCAACTCGGTCTCTTCCATCTTCTCCGGTT 

172 mboll, 

183 TGAAGGAGAGAACAACAGCTTGTTACACCCTATGAGCCTGCATGGGATGGAGGACGCGGA 
ACTTCCTCTCTTGTTGTCGAACAATGTGGGATACTCGGACGTACCCTACCTCCTGCGCCT 

243 GAAAGAAGTGTTAGTGTGGAGGTTTGACAGCAAACTAGCATTTCATCACATGGCCCGAGA 
CTTTCTTCACAATCACACCTCCAAACTGTCGTTTGATCGTAAAGTAGTGTACCGGGCTCT 

296 aval, 

303 GCTGCATCCGGAGTACTACAAAGACTGCTGACATCGAGCTTTCTACAAGGGACTTTCCGC 
CGACGTAGGCCTCATGATGTTTCTGACGACTGTAGCTCGAAAGATGTTCCCTGAAAGGCG 

314 seal, 

363 TGGGGACTTTCCAGGGAGGCGTGGCCTGGGCGGGACTGGGGAGTGGCGTCCCTCAGATGC 
ACCCCTGAAAGGTCCCTCCGCACCGGACCCGCCCTGACCCCTCACCGCAGGGAGTCTACG 

423 TGCATATAAGCAGACTGCTTTTTGCCTGTACTGGGTCTCTCTGGTTAGACCAGATCTGAG 
ACGTATATTCGTCTGACGAAAAACGGACATGACCCAGAGAGACCAATCTGGTCTAGACTC 

474 bglll, 

483 CCTGGGAGCTCTCTGGCTAACTA^ 

GGACCCTCGAGAGACCGATTGATCCCTTGGGTGACGAATTCGGAGTTATTTCGAACGGAA 

488 sacl, 518 af 111, 532 hindlll, 

639 narl, 
680 sacl, 

783 fmxgmmmsmmmm^ 0 
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Lvsl leArgLeuArgProGlyGlyLysLysLysTyrLysLeuLysHisI leValTrpAla 
843 AAAATTCGGTTAAGGCCAGGGGGAAAGAA^ 

TTTTAAGCCAATTCCGGTCCCCCTTTCTTTTTTATATTCAATTTTGTATATCATACCCGT 

SerArgGluLeuGluArgPheAlaValAsnProGlyLeuLeuGluThrSerGluGlyCy^ 
903 AGCAGGGAGCTAGAACGATTCGCAGTCAATCCTGGCCTGTTAGAAACATCAGAAGGCTGC 
TCGTCCCTCGATCTTGCTAAGCGTCAGTTAGGACCGGACAATCTTTGTAGTCTTCCGACG 

959 pstl, 

ArgGlnl leLeuGlyGlnLeuGlnProSerLeuGlnThrGlySerGluGluLeuArgSer 
963 AGACAAATATTGGGACAGCTACAGCCATCCCTTCAGACAGGATCAGAAGAACTTAGATCA 
TCTGTTTATAACCCTGTCGATGTCGGTAGGGAAGTCTGTCCTAGTCTTCTTGAATCTAGT 

1002 binl . 1008 mboll. 

LeuTy rAsnThrVa 1 A laThrLeuTy rCysVa lHisGlnArgl leAspVal LysAspThr 
1023 TTATATAATACAGTAGCAACCCTCTATTGTGTACATCAAAGGATAGATGTAAAAGACACC 
AATATATTATGTCATCGTTGGGAGATAACACATGTAGTTTCCTATCTACATTTTCTGTGG 

LysGluAlaLeuGluLysI leGluGluGluGlnAsnLysSerLysLysLysAlaGlnGln 
1083 AAGGAAGCTTTAGAGAAGATAGAGGAAGAGCAAAACAAAAGTAAGAAAAAGGCACAGCAA 
TTCCTTCGAAATCTCTTCTATCTCCTTCTCGTTTTGTTTTCATTCTTTTTCCGTGTCGTT 

✓\ *\ a 

1087 hindlll, 1097 mboll, 1107 mboll, p 25 



AlaAlaAlaAlaAlaGlyThrGlyAsnSerSerGlnValSerGlnAsnTyrProI leVal 
1143 GCAGCAGCTGCAGCTGGCACAGGAAACAGCAGCCAGGTCAGCCAAAATTACCCTATAGTG 
CGTCGTCGACGTCGACCGTGTCCTTTGTCGTCGGTCCAGTCGGTTTTAATGGGATATCAC 

/•v /\ /N 

1147 pvull, 1150 pstl, 1153 pvull, 1156 tthllll, 

GlnAsnLeuGlnGlyGlnMetValHisGlnAlal leSerProArgThrLeuAsnAlaTrp 
1203 CAGAACCTACAGGGGCAAATGGTACATCAGGCCATATCACCTAGAACTTTAAATGCATGG 
GTCTTGGATGTCCCCGTTTACCATGTAGTCCGGTATAGTGGATCTTGAAATTTACGTACC 

1250 ahalll, 1255 ava3, 

,„„ ValLysyalValGluGlyLysAlaPheSerProGluVallleProMet 
1263 GTAAAAGTAGTAGAAGAAAAGGCTTTCAGCCCAGAAGTAATACCCATGTTTTCAGCATTA 
CATTTTCATCATCTTCTTTTCCGAAAGTCGGGTCTTCATTATGGGTACAAAAGTCGTAAT 

1275 mboll, 

SerGluGlyAlaThrProGlnAspLeuA 
1323 TCAGAAGGAGCCACCCCACAAGATTTAAACACCATGCTAAACACAGTGGGGGGACATCAA 
AGTCTTCCTCGGTGGGGTGTTCTAAATTTGTGGTACGATTTGTGTCACCCCCCTGTAGTT 

1346 ahalll, 

^o, AlfAlaMetGlnMetLeuLysGluThrlleAsnGluGluAlaAlaGluTrpAspArgVal 
1383 GCAGCCATGCAAATGTTAAAAGAGACTATCAATGAGGAAGCTGCAGAATGGGATAGAGTG 
CGTCGGTACGTTTACAATTTTCTCTGATAGTTACTCCTTCGACGTCTTACCCTATCTCAC 

1423 pstl, 

,„ HisProValHisAlaGlyProIleAlaProGlyGlnMetArgGluProArgGlySerAsp 
1443 CATCCAGTGCATGCAGGGCCTATTGCACCAGGCCAAATGAGAGAACCAAGGGGAAGTGAC 
GTAGGTCACGTACGTCCCGGATAACGTGGTCCGGTTTACTCTCTTGGTTCCCCTTCACTG 

1451 sphl, 
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I leAlaGlyThrThrSerThrLeuGlnGluGlnl leGlyTrpMetThrAsnAsnProPro 
1503 ATAGCAGGAACTACTAGTACCCTTCAGGAACAAATAGGATGGATGACAAATAATCCACCT 
TATCGTCCTTGATGATCATGGGAAGTCCTTGTTTATCCTACCTACTGTTTATTAGGTGGA 

I leProValGlyGluI leTyrLysArgTrpIlel leLeuGlyLeuAsnLysI leValArg 
1563 ATCCCAGTAGGAGAAATCTATAAAAGATGGATAATCCTGGGATTAAATAAAATAGTAAGA 
TAGGGTCATCCTCTTTAGATATTTTCTACCTATTAGGACCCTAATTTATTTTATCATTCT 

MetTyrSerProThrSerl leLeuAspI leArgGlnGlyProLysGluProPheArgAsp 
1623 ATGTATAGCCCTACCAGCATTCTGGACATAAGACAAGGACCAAAGGAACCCTTTAGAGAT 
TACATATCGGGATGGTCGTAAGACCTGTATTCTGTTCCTGGTTTCCTTGGGAAATCTCTA 

1636 bStXI, 

TyrValAspArgPheTyrLysThrLeuArgAlaGluGlnAlaSerGlnAspValLysAsn 
1683 TATGTAGACCGGTTCTATAAAACTCTAAGAGCCGAACAAGCTTCACAGGATGTAAAAAAT 
ATACATCTGGCCAAGATATTTTGAGATTCTCGGCTTGTTCGAAGTGTCCTACATTTTTTA 

1720 hindlll, 

TrpMetThrGluThrLeuLeuValGlnAsnAlaAsnProAspCysLysThrl leLeuLys 
1743 TGGATGACAGAAACCTTGTTGGTCCAAAATGCAAACCCAGATTGTAAGACTATTTTAAAA 
ACCTACTGTCTTTGGAACAACCAGGTTTTACGTTTGGGTCTAACATTCTGATAAAATTTT 

1796 ahalll, 

, no, AlaLeuGlyProA 1 aA 1 aThrLeuG 1 uG 1 uMetMe tThrA 1 aCysG InGlyValGlyGly 
1803 GCATTGGGACCAGCAGCTACACTAGAAGAAATGATGACAGCATGTCAGGGAGTGGGGGGA 
CGTAACCCTGGTCGTCGATGTGATCTTCTTTACTACTGTCGTACAGTCCCTCACCCCCCT 

1827 mboll, 



1863 



ProGlyHisLysAlaArgVAlLeuAlaGluAlaMetSerGlnVAlThrAsnProAlaAsn 

PP£5P££$T^t9£ttSt5TTIT^^^aagccatgag'ccaagtaacaaatccagctaac 

GGGCCGGTATTTCGTTCTCAAAACCGACTTCGGTACTCGGTTCATTGTTTAGGTCGATTG 

pl 8 



mo* U?^?^?^]P A /:§GlyAsnPheArgAsnGlnArgLysThrValLysCysPheAsnCys 
1923 ATAATGATGCAGAGAGGCAATTTTAGGAACCAAAGAAAG 

TATTACTACGTCTCTCCGTTAAAATCCTTGGTTTCT TTCTGACAATTCACAAAGTTAACA 

GlyLysGluGlyHIsI leAlaLysAsnCysArgAlaProArgLysLysGlvCvsTrDAra 
1983 GGCAAAGAAGGGCACATAGCa 

CCGTTTCTTCCCGTGTATCGGTTTTTAACGTCCCGGGGATCCTTTTTCCCGACAACCTCT 
2014 apal, 2019 avr2, 

on , , CysGlyArgGluGlyHisGlnMetLysAspCysThrGluArgGlnAlaAsnPheLeuGlv 
2102 mboll, 

2105 ffimmmmmmui^^mm 

2101 bglll, 2111 mboll. 
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ThrAlaProProGluGluSerPheArgPheGlyGluGluLysThrThrProSerGlnlys 
2163 ACAGCCCCACCAGAAGAGAGCTTCAGGTTTGGGGAGGAGAAAACAACTCCCTCTCAGAAG 
TGTCGGGGTGGTCTTCTCTCGAAGTCCAAACCCCTCCTCTTTTGTTGAGGGAGAGTCTTC 

2175 mboll, 

GlnGluProI leAspLysGluLeuTyrProLeuThrSerLeuArgSerLeuPheGlyAsn 
2223 CAGGAGCCGATAGACAAGGAACTGTATCCTTTAACTTCCCTCAGATCACTCTTTGGCAAC 
GTCCTCGGCTATCTGTTCCTTGACATAGGAAATTGAAGGGAGTCTAGTGAGAAACCGTTG 

AspProSerSerGlnOC 
2283 GACCCCTCGTCACAATAAGGATAGGGGGGCAACTAAAGGAAGCTCTATTAGATACAGGA 
CTGGGGAGCAGTGTTATTCCTATCCCCCCGTTGATTTCCTTCGAGATAATCTATGTCCT 



MetAsnLeuProGlyLysTrpLysProLysMetl le 1 
2342 GCAGATGATACAGTATTAGAAGAAATGAATTTGCCAGGAAAATGGAAACCAAAAATGATA 
CGTCTACTATGTCATAATCTTCTTTACTTAAACGGTCCTTTTACCTTTGGTTTTTACTAT 

2360 mboll, 2375 bstXI, 

GlyGlyl leGlyGlyPhel leLysValArgGlnTyrAspGlnl leProValGluI leCys 
2402 GGGGGAATTGGAGGTTTTATCAAAGTAAGACAGTACGATCAGATACCTGTAGAAATCTGT 
CCCCCTTAACCTCCAAAATAGTTTCATTCTGTCATGCTAGTCTATGGACATCTTTAGACA 

GlyHisLysAlal leGlyThrValLeuValGlyProThrProValAsnl lei leGlyArg 
2462 GGACATAAAGCTATAGGTACAGTATTAGTAGGACCTACACCTGTCAACATAATTGGAAGA 
CCTGTATTTCGATATCCATGTCATAATCATCCTGGATGTGGACAGTTGTATTAACCTTCT 

2517 mboll, 

„ MO AsnLeuLeuThrGlnl leGlyCysThrLeuAsnPheProI leSerProI leGluThrVal 
2522 AATCTGTTGACTCAGATTGGTTGTACTTTAAATTTCCCCATTAGTCCTATTGAAACTGTA 
TTAGACAACTGAGTCTAACCAACATGAAATTTAAAGGGGTAATCAGGATAACTTTGACAT 

2548 ahalll, 2577 tthllll, 

ProValLysLeuLysProGlyMetAspGlyProLysValLysGlnTrpProLeuThrGlu 
2582 CCAGTAAAATTAAAGCCAGGAATGGATGGCCCAAAAGTTAAGCAATG^ 

GGTCATTTTAATTTCGGTCCTTACCTACCGGGTTTTCAATTCGTTACCGGTAACTGTCTT 

2627 ball, 2639 mboll, 

GluLysIleLysAlaLeuValGluI leCysThrGluMetGluLysGluGlvLvsI leSer 
2642 GAAAAAATAAAAGCATTAGTAG^ 

CTTTTTTATTTTCGTAATCATCTCTATACATGTCTTTACCTTTTCCTTCCCTTTTAAAGT 
IV?. 1 leGlyProGluAsnProTyrAsnThrProValPheAlal leLysLysLysAsDSer 

TTTTAACCCGGACTTTTAGGTATGTTATGAGGTCATAAACGATATTTCTTTTTTCTGTCA 
2759 seal. 

o7co IPfUfTfR^?U^?yy?l^§eE0 eAr 9GluLeuAsnLysArgThrGlnAspPheTrp 
2762 ACTAAATGGAGAAAACTAGTAGA^ 

TGATTTACCTCTTTTGATCATCTAAAGTCTCTTGAATTATTTTCTTGAGTTCTGAAGACC 

oooo GluValGlnLeuGly I leProHisProGlnGlyOC 
2822 GAAGTTCAGTTAGGAATACCA^ 

CTTCAAGTCAATCCTTATGGTGTGGGCGTCCCAATTTTTTTCTTTTTTAGTCATTGTCAT 
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2882 TTGGATGTGGGTGATGCATACTTTTCAGTTCCCTTAGATAAAGACTTTAGAAAGTATACTG 
AACCTACACCCACTACGTATGAAAAGTCAAGGGAATCTATTTCTGAAATCTTTCATATGAC 

2895 ava3. 

Met A rgH i sG 1 nG 1 yLeuAsp I leSerTh rMe tfrpl POL 
2943 CATTTACCATACCTAGTATAAACAATGAGACACCAGGGATTAGATA 

GTAAATGGTATGGATCATATTTGTTACTCTGTGGTCCCTAATCTATAGTCATGTTACACC 

• 2985 ecor5, 

LeuProG 1 pG 1 yTrpLysG 1 y Ser ProA l.a I lePheGlnSerSerMetThrLysI leLeu 
3003 CTGCCACAGGGATGGAAAGGATCACCAGCAA 

GACGGTGTCCCTACCTTTCCTAGTGGTCGTTATAAGGTTTCATCGTACTGTTTTTAGAAT 
3003 tthllll, 3006 bstXI, 3021 binl, 

™r, GJuProPheArgLysGlnAsnProAspIleVall leTyrGlnTyrMetAspAspLeuTyr 
3063 GAGCCTTTTAGAAAACAGAATCCAGACATA 

CTCGGAAAATCTTTTGTCTTAGGTCTGTATCAATAGATAGTTATGTACCTACTAAACATA 

ValGlySerAspLeuGluIleGlyGlnHisArgThrLysIleGluGluLeuArgGlnHis 
3123 GTAGGATCTGACTTAGAAATAGGGCAGCATAGAACAA^ 

CATCCTAGACTGAATCTTTATCCCGTCGTATCTTGTTTTTATCTCCTTGACTCTGTCGTA 
3126 binl, 3171 tthllll, 

L?uLeuArgTrpGlyPheThrThrProAspLysLysHisGlnLysGluProProPheLeu 
3183 CTGTTGAGGTGGGGATTTACCACACCAGACAAAAAACATCAGAAAGAACCK 

GACAACTCCACCCCTAAATGGTGTGGTCTGTTTTTTGTAGTCTTTCTTGGAGGTAAGGAA 

3234 bstXI, 

I^RUl^IylyGiuLeuHisProAspLysTrpThrValGlnProIleMetLeuProGlij 

t y fA?8§gfIIrPThrVal AsnAspI leGlnLysLeuValGlyLysLeuAsnTrDAiaspr 
3303 AAAGACAGCTGGACTGTCAATGACATACAGAAGTTAGTGGG^ 

TTTCTGTCGACCTGACAGTTACTGTATGTCTTCAATCACCCTTTTAACTTAA^ 
3308 pvull, 

3W mboll, 

35Maham FIG.4G 
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LeuLysThrGlyLysTyrAlaArgMetArgGlyAlaHisThrAsnAspValLysGlnLeu 
3603 CTGAAAACAGGAAAGTATGCAAGGATGAGGGGTGCCCACACTAATGATGTAAAACAGTTA 
GACTTTTGTCCTTTCATACGTTCCTACTCCCCACGGGTGTGATTACTACATTTTGTGAA 

3659 hpal, 

ThrGluAlaValGlnLysValSerThrGluSerl LeVal I leTrpGlyLysI leProLy. 
3663 ACAGAGGCAGTGCAAAAAGTATCCACAGAAAGCATAGTAATATGGGGAAAGATTCCTAAA 
TGTCTCCGTCACGTTTTTCATAGGTGTCTTTCGTATCATTATACCCCTTTCTAAGGATT 

PheLysLeuProI leGlnLysGluThrTrpGluAlaTrpTrpMetGluTyrTrpGlnAla 
3723 TTTAAACTACCCATACAAAAGGAAACATGGGAAGCATGGTGGATGGAGTATTGGCAAGCT 
AAATTTGATGGGTATGTTTTCCTTTGTACCCTTCGTACCACCTACCTCATAACCGTTCGA 

3723 ahalll, 

TDfItRli?^2§l u I rp61uPheValAsnTh '"ProProLeuValLysLeuTrpTyrGln 
3783 ACCTGGATTCCTGAGTGGGAGTTTGTCAATACCCCTCC 

TGGACCTAAGGACTCACCCTCAAACAGTTATGGGGAGGGAATCACTTTAATACCATGGTC 
3835 kpnl, 

70l7 LeyGluLysGluProHeValGlyAlaGluThrPh^ 
3843 TTAGAGAAAGAACCCATAGTAGGAGC 

AATCTCTTTCTTGGGTATCATCCTCGTCTTTGAAAGATACATCTACCCCGTCGATTATCC 

GluThrLysLeuGlyLysAlaGlyTyrValThrAspArgGlyArgGlnLysValValSe, 
3903 GAGACTAAATTAGGAAAAGa 

CI CTGATTTAATCCTTTTCGTCCTATACAATGACTGTCTCCTTCTGTTTTTCAACAGAGG 
3913 mboll, 

-, n r-, !i?Al§AspThrThrAsnGlnLysThrGluLeuGlnAlaI leHisLeuAlaLeuGlnAsn 

3963 

3983 mboll, 

tino* frrr^t?^lyy?I A fPU?y?l ThrAs PSerGlnTyrAlaLeuGlyIleileGlnAla 

,023 SK^^ 



4083 



4060 ava3, 

4163 kpnl , 
4232 seal, 
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GlnGluGluHlsGluLysTyrHisSerAsnTrpArgAlaMetAlaSerAspPheAsnLetj 
1263 CAAGAAGAACATGAGAAATATCACAGTAATTGGAGAGCAATGGCTAGTGATTTTAACCTG 
GTTCTTCTTGTACTCTTTATAGTGTCATTAACCTCTCGTTACCGATCACTAAAATTGGAG 

1266 mboll. 

ProProValValAlaLysGluI leValAlaSerCysAspLysCysGlnLeuLysGIyGlu 
4323 £CACCTGTAGTAGCAAAAGAAATAGTAGCCAGCTGTGATAAATGTC 

GGTGGACATCATCGTTTTCTTTATCATCGGTCGACACTATTTACAGTCGATTTTCCTCTT 

1352 pvull. 

, ,o, AlaMetHisGlyGlnValAspCysSerProGlyl leTrpGlnLeuAspCysThrHisLeu 
1383 GCCATGCATGGAC^GTAGACTGTAG 

CGGTACGTACCTGTTCATCTGACATCAGGTCCTTATACCGTTGATCTAACATGTGTAGAT 

/\ ^ 

1386 ava3, 1110 bstXI, 1139 xbal, 

. GluGlyLysIlelleLeuValAlaValHisValAlaSerGlyTyrlleGluAlaGluVal 
1113 GAAGGAAAAATTATCCTGGTAGC^ 

CTTCCTTTTTAATAGGACCATCGTCAAGTACATCGGTCACCTATATATCTTCGTCTTCAA 
1197 xmnl, 

I leProA laG 1 uThrG lyGlnGl uThr A laTyrPheLeuLeuLvsLeuA 1 aG 1 vA raTm 
1503 ATTCCAGCAGAGACAGGGCAGGAAACAGCATATTTTCTCTT 

TAAGGTCGTCTCTGTCCCGTCCTTTGTCGTATAAAAGAGAATTTTAATCGTCCTTCTACC 

1555 mboll, 1560 ball, 
, ProValLysThrl leHisThrAspAsnGlySerAsnPheThrSerThrThrVal I vsAia 
G^TWTTTTTGTTATGTATGTCTGTTA^ 
1605 seal, 

1623 SaP^M 

CGGACAACCACCCGTCCCTAGTTCGTCCTTAAACCGTAAGGGATGT^ 
1639 binl, 

^y?!y§! GIuSerMetA snAsnGluleuLysLysI lei leGlvGlnVaiAraflcnnin 

cgacttgtggaattctgtcgtcatgtttaccgtcataagtaggtgtUaaatt 
1752 aim. ii791 ahalll, 

FIG. 41 
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AspAsnLysAspProLeuTrpLysGlyProAlaLysLeuLeuTrpLysGlyGluGlyAla' 
4923 GACAACAAAGATCCCCTTTGGAAAGGACCAGCAAAGCTTCTCTGGAAAGGTGAAGGGGCA* 
CTGTTGTTTCTAGGGGAAACCTTTCCTGGTCGTTTCGAAGAGACCTTTCCACTTCCCCGT 

4956 hindlll, 

Valval I leGlnAspAsnSerAspI leLysValValProArgArgLysAlaLysI lei le Ti 
4983 GTAGTAATACAAGATAATAGTGACATAAAAGTAGTGCCAAGAAGAAAAGCAAAAATCATT 
CATCATTATGTTCTATTATCACTGTATTTTCATCACGGTTCTTCTTTTCGTTTTTAGTAA 

5023 mboll, 

, Me tG 1 uA sn A rgTrpG 1 nVa 1 Me 1 1 1 eVa 1 T rpG 1 nVa 1 A spA rgMe t A r g 1 1 e 
ArgAspTyrGlyLysGlnMetAlaGIyAspAspCysValAlaSerArgGlnAspGluAsp 
5043 AGGGATTATGGAAAACAGATGGCAGGTGATGATTGTGTGGCAAGTAGACAGGATGAGGAT 
TCCCTAATACCTTTTGTCTACCGTCCACTACTAACACACCGTTCATCTGTCCTACTCCTA 

ArgTreTrpLysSerLeuValLysHlsHisMetTyrl leSerLysLysAlaLysGlyTrp 
AM 

5103 TAGAACATGGAAAAGTTTAGTAAAACACCATATGTATATTTCAAAGAAAGCTAAAGGATGG 
ATCTTGTACCTTTTCAAATCATTTTGTGGTATACATATAAAGTTTCTTTCGATTT CCTACC 



5281 



5131 ndel, 1 

PheTyrArgHlsHisTyrGluSerThrHisProArgValSerSerGluValHisI le ! 
5163 TTTTATAGACATCACTATGAAAGTACTCATCCAAGAGTAAGTTCAGAAGTACACATC 1 
AAAATATCTGTAGTGATACTTTCATGAGTAGGTTCTCATTCAAGTCTTCATGTGTAG I 

5185 seal, I 

.-oo, ProLeuGlyAspAlaLysLeuVall leThrThrTyrTrpGlyLeuHisThrGlyGluArg ! 
5221 CCCCTAGGGGATGCTAAATTGGTAA^ I 

GG5GATCCCCTACGATTTAACCATTATTGTTGTATAACCCCAGACGTATGTCCTCTTTCT I 
5223 avr2, I 

Mimmmwrnmrnmrn : 

5440 avr2, | 
r-, §iYyi sAsnl -ysValGlySerLeuGlnTyrLeuAlaLeuAlaAiai phi iPThrPmi uc I 

5476 binl, ' 

5521 wmmmmmusmmi 

FIG. 4J 1 
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GlnLysThrLysGlyHisArgGlySerHisThrMetAsnGlyHisAM 
5581 CAGAAGACCAAGGGCCACAGAGGGAGCCATACAATGAATGGACACTAGAGCTTTTAGAGG 
GTCTTCTGGTTCCCGGTGTCTCCCTCGGTATGTTACTTACCTGTGATCTCGAAAATCTCC 

5583 mboll, 

5611 AGCTTAAGAGAGAAGCTGTTA^ 

TCGAATTCTCTCTTCGACAATCTGTAAAAGGATCCGGTACCGAGGTATCGAATCCTGTTA 

5613 af 111, 5670 avr2, 5676 ncol, 

5701 ATATCTATGAAACTTATGGGGATAa 

TATAGATACTTTGAATACCCCTATGAACCCGTCCTCACCTTCGGTATTATTCTTAAGACG 

5752 ecorl , 

5761 AACAACTGCTGTTTATTCATTTCAGAATTGGGTGTCAACATAGCAGAATAGG 

TTGTTGACGACAAATAAGTAAAGTCTTAACCCACAGTTGTATCGTCTTATCCGTAATAAG 

5821 AACAGAGGAGAGCAAGAAGAA^ 

TTGTCTCCTCTCGTTCTTCTTTACCTCGGTCATCTAGGATTAGATCTCGGGACCTTCGTA 

5836 mboll, 5862 xbal, 

5881 CCAGGAAGTCAGCCTAGGACTGCTTGTAACAATTGCTATTGTAAAAAGTGTTGCTTTCAT 
GGTCCTTCAGTCGGATCCTGACGAACATTGTTAACGATAACATTTTTCACAAC^ 

5893 avr2, 

5915 mlul. 5988 mboll, 

6001 ^^ifw^^st^m^^iw 

6008 mboll, 6011 sacl, 6016 mstll, 6038 hindlll, 

6061 mm^mmMum^mmm^ 

TATTATCGTTATCAAC^^ 
6117 seal, 



6181 mmmmmmmm^mM 

6222 mboll, 

FIG. 4K 
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ValTrpLysGluAlaThrThrThrLeuPheCysAlaSerAspAlaArgAlaTyrAspThr 
6361 GT6TGGAAAGAAGCAACTACCACTCTATTTTGTGCATCAGATGCTAGAGCATATGATACA: 
CACACCTTTCTTCGTTGATGGTGAGATAAAACACGTAGTCTACGATCTCGTATACTATGT- 

6410 ndel, 

GluValHisAsnValTrpAlaThrHisAlaCysValProThrAspProAsnProGlnGlu 
6121 GAGGTACATAATGTTTGGGCCACACATGCCTGTGTACCCACAGACCCCAACCCACAAGAA 
CTCCATGTATTACAAACCCGGTGTGTACGGACACATGGGTGTCTGGGGTTGGGTGTTCTT 

Va 1 Va 1 LeuG lyAsnVa 1 ThrG luAsnPheAsnMetTrpLysAsnAsnMe tVa 1 G 1 uG 1 n 
6181 GTAGTATTGGGAAATGTGACAGAAAATTTTAACATGTGGAAAAATAACATGGTAGAACAG 
CATCATAACCCTTTACACTGTCTTTTAAAATTGTACACCTTTTTATTGTACCATCTTGTC 

MetGlnGluAspI lei leSerLeuTrpAspGlnSerLeuLysProCysValLysLeuThr 
6541 ATGCAGGAGGATATAATCAGTTTATGGGATCAAAGCCTAAAGCCATGTGTAAAATTAACC 
TACGTCCTCCTATATTAGTCAAATACCCTAGTTTCGGATTTCGGTACACATTTTAATTGG 

6567 binl, 

ProLeuCysValThrLeuAsnCysThrAspLeuGlyLysAlaThrAsnThrAsnSerSer 
6601 CCACTCTGTGTTACTTTAAATTGCACTGATTTGGGGAAGGCTACTAATACCAATAGTAGT 
GGTGAGACACAATGAAATTTAACGTGACTAAACCCCTTCCGAT.GATTATGGTTATCATCA 

6615 ahalll, 

AsnTrpLysGluGluI leLysGlyGluI leLysAsnCysSerPheAsnl leThrThrSer 
6661 AATTGGAAAGAAGAAATAAAAGGAGAAATAAAAAACTGCTCTTTCAATATCACCACAAGC 
TTAACCTTTCTTCTTTATTTTCCTCTTTATTTTTTGACGAGAAAGTTATAGTGGTGTTCG 

6670 mboll, 

IleArgAspLysIleGlnLysGluAsnAlaLeuPheArgAsnLeuAspValValProIle 
6721 ATAAGAGATAAGATTCAGAAAGAAAATGCACTTTTTCGTAACCTTGATGTAGTACCAATA 
TATTCTCTATTCTAAGTCTTTCTTTTACGTGAAAAAGCATTGGAACTACATCATGGTTAT 

AspAsnAlaSerThrThrThrAsnTyrThrAsnTyrArgLeuI leHisCysAsnArgSer 

CTATTACGATCATGATGATGGTTGATATGGTTGATATCCAACTATGTAACATTGTCTAGT 
6790 seal. 

Val I leThrGlnAlaCysProLysValSerPheGluProI leProI leHisTvrCvsThr 
6841 aSffif^^^CTGTCCAAAGGTATCATT^ 

CAGTAATGTGTCCGGACAGGTTTCCATAGTAAACTCGGTTAAGGGTATGTAATAACATGG 

6851 still. 

ProAlaGlyPheAlal leLeuLysCysAsnAsnLysThrPheAsnGlyLysGlvProfvs 
6901 CCGGCTGGTTTTGCGATTCTAAAGTGTAATAATAAAACGna 

GGCCGACCAAAACGCTAAGATTTCACATTATTATTTTGCAAGTTACCTTTTCCTGGTACA 

JhrAsnValSerThrValGlnCysThrHisGlyl leArgProI leValSerThrGlnLpu 
6961 ACAAATGTCAGCACAGTACAATG^ 

TGTTTACAGTCGTGTCATGTTACATGTGTACCTTAATCCGGTTATCACAGTTGAGTTGAC 
LeuLeuAsnGlySerLeuAlaGluGluGluValVal I leArqSerAsDAsnPhpThrAsn 

GACAATTTACCGTCAGATCGTCTTCTTCTCCATCATTAATCTAGACTGTTAAAGTGCTTG 
7042 mboll, 7045 mboll. 7060 bglll, 

FIG. 4L 
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AsnAlaLysThrl lei leValGlnLeuAsnGluSerValAlal leAsnCysThrArgPro 
7081 AATGCTAAAACCATAATAGTACAGCTGAATGAATCTGTAGCAATTAACTGTACAAGAC'CC 
TTACGATTTTGGTATTATCATGTCGACTTACTTAGACATCGTTAATTGACATGTTCTGGG 

7102 pvull, 

, AsnAsnAsnThrArgLysSerlleTyrlleGlyProGlyArgAlaPheHisThrThrGly 
7141 AACAACAATACAAGAAAAAGTATCTATATAGGACCAGGGAGfc 

TTGTTGTTATGTTCTTTTTCATAGATATATCCTGGTCCCTCTCGTAAAGTATGTTGTCCT 

7199 mboll, 

A |:?UeIleGlyAspIleArgLysAlaHisCysAsnI leSerArgAlaGlnTrpAsnAsn 
7201 AGAATAATAGGAGATATAAGAAAAGCACATTGTAACATTAGTAGAGCACAATGGAATAAC 
TCTTATTATCCTCTATATTCTTTTCGTGTAACATTGTAATCATCTCGTGTTACCTTATTG 

ThrLeuGluGlnl leValLysLysLeuArgGluGlnPheGlyAsnAsnLvsThrl leVal 
7261 ACTTTAGAACAGATAGTTAAAAAATTAAGAGAACAGTTTGGG^ 

TGAAATCTTGTCTATCAATTTTTTAATTCTCTTGTCAAACCCTTATTATTTTGTTATCAG 

PheAsnGlnSerSerGlyGlyAspProGluI leValMetHisSerPheAsnCvsAraGlv 
7321 TTTAATCAATCCTCAGGAGGGGACCCAGAAATTGTAATG^ 

AAATTAGTTAGGAGTCCTCCCCTGGGTCTTTAACATTACGTGTCAAAATTAACATCTCCC 

7331 mstll, 
770 . GluPhePheTyrCysAsnThrThrGln 

7381 GAATTTTTCTACTGTAATACAACACAACTGTTTAATAATACATGM 

CTTAAAAAGATGACATTATGTTGTGTTGACAAATTATTATGTACCTCCAATTTA^ 

7621 fsGcfcCAGAAGTCTGCACcfcCTCCTCTA^ 
7628 mboll, 

7736 mboll, 

™wmmmmmmmm 

FIG. 4M 
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LeuLeuSerGlyl levalGlnGlnGlnAsnAsnLeuLeuArgAlal leGluAlaGlnGln 
7861 TTATTGTCTGGTATAGTGCAACAGCAGAACAATTTGCTGAGGGCTATTGAGGCGCAACAA 
AATAACAGACCATATCACGTTGTCGTCTTGTTAAACGACTCCCGATAACTCCGCGTTGTT 

HisLeuLeuGlnLeuThrValTrpGlyl leLysGlnLeuGlnAlaArgValLeuAlaVal 
7921 CATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAGGCAAGAGTCCTGGCTGTG 
GTAGACAACGTTGAGTGTCAGACCCCGTAGTTCGTCGAGGTCCGTTCTCAGGACCGACAC 

GluArgTyrLeuArgAspGlnGlnLeuLeuGlyl leTrpGlyCysSerGlyLysLeuI le 
7981 GAAAGATACCTAAGGGATCAACAGCTCCTAGGGATTTGGGGTTGCTCTGGAAAACTCATT 
CTTTCTATGGATTCCCTAGTTGTCGAGGATCCCTAAACCCCAACGAGACCTTTTGAGTAA 

7989 mstll, 7995 binl, 8007 avr2, 

CysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluAspI leTrp 
8041 TGCACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAAGACATTTGG 
ACGTGGTGACGACACGGAACCTTACGATCAACCTCATTATTTAGAGACCTTCTGTAAACC 

8089 mboll, 

AspAsnMetThrTrpMetGlnTrpGluArgGluI leAspAsnTyrThrAsnThrl leTyr 
8101 GATAACATGACCTGGATGCAGTGGGAAAGAGAAATTGACAATTACACAAACACAATATAC 
CTATTGTACTGGACCTACGTCACCCTTTCTCTTTAACTGTTAATGTGTTTGTGTTATATG 

ThrLeuLeuGluGluSerGlnAsnGlnGlnGluLysAsnGluGlnGluLeuLeuGluLeu 
8161 ACCTTACTTGAAGAATCGCAGAACCAACAAGAAAAGAATGAACAAGAATTATTAGAATTG 
TGGAATGAACTTCTTAGCGTCTTGGTTGTTCTTTTCTTACTTGTTCTTAATAATCTTAAC 

8170 mboll, 

AspLysTrpAlaSerLeuTrpAsnTrpPheSerl leThrAsnTrpLeuTrpTyr I leLys 
8221 GATAAGTGGGCAAGTTTGTGGAATTGGTTTAGCATAACAAACTGGCTGTGGTATATAAAG 
CTATTCACCCGTTCAAACACCTTAACCAAATCGTATTGTTTGACCGACACCATATATTTC 

ooo, U?PhejleMetIleValGlyGlyLeuValGlyLeuArgIleValPheAlaValLeuSer 
8281 ATATTCATAATGATAGTAGGAGGCTTGGTAGGTTTAAGAATAGTTTTTGC 

TATAAGTATTACTATCATCCTCCGAACCATCCAAATTCTTATCAAAAACGACACGAAAGA 

o,, , !I? v alAsnArgValArgGlnGIyTyrSerProLeuSerPheGlnThrArgLeuProVal 
8341 ei^IGAATAGjGTTAGSCAGGGATACTCACCATTGTCAm^ 

TATCACTTATCTCAATCCGTCCCTATGAGTGGTAACAGTAAAGTCTGGGCGGAGGGTCAG 

8400 aval, 

„, nt ProArgGlyProAspArgProAspGlyl leGluGluGluGlyGlyGluArgAspArgAsp 
8401 CCGAGGGGACCCGACAGGCCCGACGGAATCGAAGAAG 

GGCTCCCCTGGGCTGTCCGGGCTGCCTTAGCTTCTTCTTCCACCTCTCTCTCTGTCTCTG 

8431 mboll, 8434 mboll, 

o, C1 ^SerValAraLeuValAspGlyPheLeuAlaLeuI leTrpGluAspLeuArgSerLeu 
8^61 AGATCCGTTCGATTAGTGGAT^ 

TCTAGGCAAGCTAATCACCTACCTAAGAATCGTGAATAGACCCTTCTAGACGCCTCGGAC 
8503 mboll, 8505 bglll, 

8521 Irrrlr IlrTrPlT^^^^IT^^^^^^T^^TCTTGATTGCAGCGAGGACTGTGGAA 
ACGGAGAAGTCGATGGCGGCGAACTCTCTGAATGAGAACTAACGTCGCTCCTGACACCTT 

8525 mboll. ■ 
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I leLeuGlyHlsArgGlyTrpGluAlaLeuLysTyrTrpTrpSerLeuLeuGlriTyrTrp • 
8581 ATTCTGGGGCACAGGGGGTGGGAAGCCCTCAAATATTGGTGGAGTCTCCTGCAGTATTGG : 
TAAGACCCCGTGTCCCCCACCCTTCGGGAGTTTATAACCACCTCAGAGGACGTCA'TAACC: 

8629 pstl, 

IleGlnGluLeuLysAsnSerAlaValSerTrpLeuAsnAlaThrAlalleAlaValThr 
8641 ATTCAGGAACTAAAGAATAGTGCTGTTAGCTGGCTCAACGCCACAGCTATAGCAGTAACT 
TAAGTCCTTGATTTCTTATCACGACAATCGACCGAGTTGCGGTGTCGATATCGTCATTGA 

GluGlyThrAspArgVal I leGluValAlaGlnArgAlaTyrArgAlal leLeuHisI le 
8701 GAGGGGACAGATAGGGTTATAGAAGTAGCACAAAGAGCTTATAGAGCTATTCTCCACATA 
CTCCCCTGTCTATCCCAATATCTTCATCGTGTTTCTCGAATATCTCGATAAGAGGTGTAT 

HisArgArgl leArgGlnGlyLeuGluArgleuLeuLeuOC MetGlyGlyLysTrpSer 
8761 CATAGAAGAATTAGACAGGGCTTGGAAAGGCTTTTGCTATAAGATGGGTGGCAAGTGGTCA 
GTATCTTCTTAATCTGTCCCGAACCTTTCCGAAAACGATATTCTACCCACCGTTCACCAGT 



8765 mboll, 

o„™ ^?£££§£C M fJ§ lyG1 y Tr P SerAlaI leArgGluArgMetArgArgAlaGluProArg 
8822 AAACGTAGTATGGGTGGATGGTCTGCTATAAGGGAAAGAATGAGACGAGCTGAGCCACGA 
TTTGCATCATACCCACCTACCAGACGATATTCCCTTTCTTACTCTGCTCGACTCGGTGCT 

nnnn AlaGluProAlaAlaAspGlyValGlyAlaValSerArgAspLeuGluLysHisGlyAla 
8882 GCTGAGCCAGCAGCAGATGGGG 

CGACTCGGTCGTCGTCTACCCCACCCTCGTCATAGAGCTCTGGACCTTTTTGTACCTCGT 
8883 tthllll, 8916 aval xhol, 

on,o U^ThrSerSerAsnThrAlaAlaThrAsnAlaAspCysAlaTrpLeuGluAl^ 
89£ » 2 Hf^^GTAGCAATACAGCAGa 

TAGTGTTCATCGTTATGTCGTCGATGATTACGACTAACACGGACCGATCTTCGTGTTCTC 

nnnn GluGluGluValGlyPheProValArgProGlnValProLeuArgProMetThrTvrLvs 
9002 GAGGAAGAGGTGGGTTTTCCAG^ 

CTCCTTCTCCACCCAAAAGGTCAGTCTGGAGTCCATGGAAATTCTGGTTACTGAATGTTC 
9005 mboll, 9029 mstll, 903J_kpnl, 

9062 

CGTCGAAATCTATAATCGGTGAAAAATTTTCTTTTCCCCCCTGACCTTCCCGATTAAACC 
9085 ahalll, 

SerGlnArgArgGlnGluI leLeuAspLeuTrpI leTyrHisThrGlnGlvTvrPhpPrn 
AG6GTTTCTT CTGTT CTCTAGGAACTAGACA^ 
9129 mboll, 9153 binl, 

^6lC6 GlnAsnT yrThrProGlyProGlyIleArgTyrProLeuThrPheGlvTrDrv<; 

9182 ^mmunm^^i^^nM 

9210 binl. 9216 ecor5, 

FIG. 40 



PheLysLeuValProValGluProGlulysValGluGluAlaAsnGluGlyGluAsnAsn 
9242 TTCAAGCTAGTACCAGTTGAGCCAGAGAAGGTAGAAGAGGCCAATGAAGGAGAGAACAAC 
AAGTTCGATCATGGTCAACTCGGTCTCTTCCATCTTCTCCGGTTACTTCCTCTCTTGTTG 

9275 rriboll. 

SerLeuLeuHisProMetSerLeuHisGlyMetGluAspAlaGluLysGluValLeuVal 
9302 AGCTTGTTACACCCTATGAGCCTGCATGGGATGGAGGACGCGGAGA^ 

TCGAACAATGTGGGATACTCGGACGTACCCTACCTCCTGCGCCTCTTTCTTCACAATCAC 

n . M TrpArgPheAspSerLysLeuAl^ 

9362 TGGAGGTTTGACAGCAAACTAGCATTTCATCACATGGCCCGAGAGCTGCATCCGGAGTAC 
ACCTCCAAACTGTCGTTTGATCGTAAAGTAGTGTACCGGGCTCTCGACGTAGGCCTCATG 

9399 aval, 9417 seal, 
TyrLysAspCysOP 

9422 TACAAAGACTGCTGACATCGAGCTTTCTACAAGGGACTTTCCGCTGGGGACTTTCCAGGG 
ATGTTTCTGACGACTGTAGCTCGAAAGATGTTCCCTGAAAGGCGACCCCTGAAAGGTCCC 



9482 TrrrrIr9r9I99^^^^^^^^^^^^^^^CGTCCCTCAGATGCTGCATATAAGCAGCTG 
TCCGCACCGGACCCGCCCTGACCCCTCACCGCAGGGAGTCTACGACGTATATTCGTCGAC 

9536 pvull, 

951,2 ™^m^a?M^iam^aaa?g 

9576 bglll, 9590 sacl, 
9620 af 111, 9634 hindlll, 

9722 TGGAAAAATCTCTAGCAG 

ACCTTTTTAGAGATCGTC „ 

FIG. 4P 
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U3-»* 

-453 CT6GAAG6GCTAATTTGGTCCCAAAGAAGACAAGAGATCCTTGATCTGTGGATCTACCAC , 
ACACAAGGCTACTTCCCTGATTGGCAGAATTACACACCAGGGCCAGGGATCAGATATCCA 

-333 CTGACCTTTGGATGGTGCTTCAAGCTAGTACCAGTTGAGCCAGAGAAGGTAGAAGAGGCC 

AATGAAGGAGAGAACAACAGCTTGTTACACCCTATGAGCCTGCATGGGATGGAGGACGCG L 

-214 GAGAAAGAAGTGTTAGTGTGGAGGTTTGACAGCAAACTAGCATTTCATCACATGGCCCGA 

GAGCTGCATCCGGAGTACTACAAAGACTGCTGACATCGAGCTTTCTACAAGGGACTTTCCG J 
-93 CTGGGGACTTTCCAGGGAGGCGTGGCCTGGGCGGGACTGGGGAGTGGCGTCCCTCAGATG 

CTGCA TATAA GCAGCTGCTTTTTGCCTGTACTG GGTCTCTGTGGTTAGACCAGATCTGAG R 
28 CCTGGGAGCTCTCTGGCTAACTAGGGAACCCACTGCTTAAGCCTCAATAAAGCTTGCCTT 
R U5 

GAGTGCTTCA AGTAGTGTGTGCCCGTCTGTTGTGTGACTCTGGTAACTAGAGATCCCTCA 

--U5 

148 GACCCTTTTAGTCAGTGTGGAAAAATCTCTAGCAG TGGCGCCCGAACAGGGACGCGAAA 
GCGAAAGTAGAACCAGAGGAGCTCTCTCGACGCAGGACTCGGCTTGCTGAAGCGCGCACAG 

268 CAAGAGGCGAGGGGCGGCGACTGGTGAGTACGCCAATTTTTGACTAGCGGAGGCTAGAAG 

MetGlyAlaArgAlaSerValLeuSerGlyGlyGluLeuAspLysTrpGlu 17 
GAGAGAGAGATGGGTGCGAGAGCGTCGGTATTAAGCGGGGGAGAATTAGATAAATGGGAA 

Lysl leArgLeuArgProGlyGlyLysLysLysTyrLysLeuLysHisI leValTrDAla 
388 AAAATTCGGTTAAGGCCAGGGGGAAAGAAAAAATATAAGTTAAAA^ 



57 



508 



628 
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Ser ArgGl uLeuG luArgPheA lava 1 AsnProG 1 yLeuLeuG 1 uThrSerG 1 uG 1 yCys 
AGCAGGGAGCTAGAACGATTCGCAGTCAATCCTGGCCTGTTAGAAACATCAGAAGGC^ 

Ar^i^U^^Jly^iPU^lP ProSerLeuGlnThr GlySerGluGluLeuArgSer 
AGACAAATATTGGGACAGCTACAGCCATCCCTTCAGACAGGATCAGAAGAACTTAGATCA 

LeuTy r AsnThrVa 1 A laThrLeuTy rCysVa lHlsGlnArgl leAsDVa 1 LvsAsDThr 97 
TTATATAATACAGTAGCAACCCTCTATTGTGTACATCAAAGGATAGATGTAA 

LysGluAlaLeuGluLysI leGluGluGluGlnAsnLysSerLysLvsLvsAlaGlnGln 
AAGGAAGCTTTAGAGAAGATAGAGGAAGAGCAAAACAAAA^ 

AlaAlaAlaAlaAlaGlyThrGlyAsnSerSerGlnValSerGlnAsnTyrProIleVal 137 
GCAGCAGCTGCAGCTGGCACAGGAAACAGCAGCCAGGTCAGCCAAAATTACCCTATAGTG 

CAGAACCTACAGGGGCAAATGGTACATCAGGCCATATCACCTAGAACTTTAAATGCATGG 
ValLysyaiyalGluGluLysAlaPheSerProGluVal I leProMetPheSerAlaLeu 177 

gtaaaagtagtagaagaaaaggctttcagcccagaagtaatacccatgWttcagcatt 

§?C5iySlyAlaThrProGlnAspLeuAsnThrMetLeuAsnThrValGlyGlvHisGln 
868 TCAGAAGGAGCCACCCCACAAGATTTAAACACCATGCTAAACACAGTGGGGGGACA^ 

FIG. 5A 
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A 1 a A 1 aMetG 1 nMetLeuLy sG luThr 1 1 eA snG 1 uG 1 uA 1 aA 1 aG 1 uTrpAspA rgVa 1 217 G 
GCAGCCATGCAAATGTTAAAAGAGACTATCAATGAGGAAGCTGCAGAATGGGATAGAGTG . 

HisProValHisAlaGlyProI leAlaProGlyGlnMetArgGluProArgGlySerAsp 
988 CATCCAGTGCATGCAGGGCCTATTGCACCAGGCCAAATGAGAGAACCAAGGGGAAGTGAC A 

UeAlaGlyThrThrSerThrLeuGlnGluGlnlleGlyTrpMetThrA 257 
ATAGCAGGAACTACTAGTACCCTTCAGGAACAAATAGGATGGATGACAAATAATCCACCT 

Q 

I leProValGlyGluI leTyrLysArgTrpI lei leLeuGlyLeuAsnLysI leValArg 
1108 ATCCCAGTAGGAGAAATCTATAAAAGATGGATAATCCTGGGATTAAATAAAATAGTAAGA 

MetTyrSerProThrSerl leLeuAspI leArgGlnGlyProLysGluProPheArgAsp 297 
ATGTATAGCCCTACCAGCATTCTGGACATAAGACAAGGACCAAAGGAACCCTTTAGAGAT 

1000 TyrValAspArgPheTyrLysThrLeuArgAlaGluGlnAlaSerGlnAspValLysAsn 
1228 TATGTAGACCGGTTCTATAAAACTCTAAGAGCCGAACAAGCTTCACAGGATGTAAAAAAT 

IfR^fJfTDr5]yTD rLeuLeuValGlnAsn AlaAsnProAspCysLysThrIleLeuLys 337 
TGGATGACAGAAACCTTGTTGGTCCAAAATGCAAACCCAGATTGTAAGACTATTTTAAAA 



1348 



A 1 aLeuG 1 y ProA 1 aA 1 aTh rLeuG 1 uG 1 uMe tMetTh r A 1 aCysG InGlyValGlyGly 
GCATTGGGACCAGCAGCTACACTAGAAGAAATGATGACAGCATGTCAGGGAGTGGGGGGA 

PrpGlyHisLysAlaArgValLeuAlaGluAlaMetSerGlnValThrAsnProAlaAsn 377 
CCCGGCCATAAAGCAAGAGTTTTGGCTGAAGCCATGAGCCAAGTAACAAATCCAGCTAAC 

I leMetMetGlnArgGlyAsnPheArgAsnGlnArgLysThrValLysCysPheAsnCvs 
ATAATGATGCAGAGAGGCAATTTTAGGAACCAAAGAAAGACTGTTAAGTGTTK 

^AXW^§^]y5 lyH i s I leAlaLysAsnCysArgAlaProArgLysLysGlyCysTrpAra 417 
GGCAAAGAAGGGCACATAGCCAAAAATTGCAGGGCCCCTAGGAAAA^ 1 

CysGlyArgGluGlyHisGlnMetLysAspCysThrGluArgGlnAlaAsnPheLeuGly 

1588 TGTGGAAGGGAAGGACACCAAATGAAAGATTGCACTGAGAGACAGGCTAATTTTTTAGGG 

V nf iif L r .§f LgSirly.^ V y ^§ 1 y A r 9? r 9 G 1 snPheLeuG I n Se r A rgP roG 1 uP ro 457 
iy5§St§ uAlaP,i eLeuGlnGlyLysAlaArgGluPheSerSerGl iiRinThrflraAia o\ 

AAGATCTGGCCTTCCTACAAGGGAAGGCCAGGGAATTTTCTK 23 

■ 9iP9J* i 5 ""? 1 1 ? As R'r y sGluLeuTyrProLeuThrSerLeuArgSerLeuPheGlvAsn 4Q7 

CAGGAGCCGATAGACAAGGAACTGTATCCTTTAACTTCCCTCAGATCACTCTTTGGCAAC 
AspProSerSerGlnOC 

FIG. 5B 
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lyHisLysAlalleGlyThrValLeuValGlyProThrProValAsnl lelleGlyArg 143 
GACATAAAGCTATAGGTACAGTATTAGTAGGACCTACACCTGTCAACATAATTGGAAGAA 

AsnLeuLeuThrGlnl leGlyCysThrLeuAsnPheProI leSerProI leGluThrVal'P 
2068 ATCTGTTGACTCAGATTGGTTGTACTTTAAATTTCCCCATTAGTCCTATTGAAACTGTAC 

ToValLysLeuLysProGlyMetAspGlyProLysValLysGlnTrpProLeuThrGlu '183 
CAGTAAAATTAAAGCCAGGAATGGATGGCCCAAAAGTTAAGCAATGGCCATTGACAGAAG 

GluLysIleLysAlaLeuValGluI leCysThrGluMetGluLysGluGlyLysI leSerL 
2188 .AAAAAATAAAAGCATTAGTAGAGATATGTACAGAAATGGAAAAGGAAGGGAAAATTTCAA 

ysl leGlyProGluAsnProTyrAsnThrProValPheAlal leLysLysLysAspSer 223 
AAATTGGGCCTGAAAATCCATACAATACTCCAGTATTTGCTATAAAGAAAAAAGACAGTA 

ThrLysTrpArgLysLeuvalAspPheArgGluLeuAsnLysArgThrGlnAspPheTrpG 
2308 CTAAATGGAGAAAACTAGTAGATTTCAGAGAACTTAATAAAAGAACTCAAGACTTCTGGG 

luValGlnLeuGlylleProHisProAlaGlyLeuLysLysLysLysSerValThrVal 263 
AAGTTCAGTTAGGAATACCACACCCCGCAGGGTTAAAAAAGAAAAAATCAGTAACAGTAT 

LeuAspvalGlyAspAlaTyrPheSerValProLeuAspLysAspPheArgLysTyrThrA 
2428 TGGATGTGGGTGATGCATACTTTTCAGTTCCCTTAGATAAAGACTTTAGAAAGTATACTG 

laPheThrl leProSerl leAsnAsnGluThrProGlyl leArgTyrGlnTyrAsnVal 303 
CATTTACCATACCTAGTATAAACAATGAGACACCAGGGATTAGATATCAGTACAATGTGC 

, C1 Q LeuProG 1 nG 1 yTrpLysG 1 ySerProA la I lePheG 1 nSerSerMetThrLys I leLeuG 
2548 TGCCACAGGGATGGAAAGGATCACCAGCAATATTCCAAAGTAGCATGACAAAAATCTTAG 

JuProPheArgLysGlnAsnProAspI leVal I leTyrGlnTyrMetAspAspLeuTyr 343 
AGCCTTTTAGAAAACAGAATCCAGACATAGTTATCTATCAATACATGGATGATTTGTAT^ 

ozz Q y r\9tt&tt$ P^?y£iyU?£,U$!pyi sArgThrLys IleGluGl uLeuArgG InH i sL 
2668 TAGGATCTGACTTAGAAATAGGGCAGCATAGAACAAAAATAGAGGAACTGAGACAGCATC 

euLeuArgTrpGlyPheThrThrProAspLysLysHisGlnLysGluProProPheieu 383 
TGTTGAGGTGGGGATTTACCACACCAGACAAAAAACATCAGAAAGAACC^ 

o^ 0 o T fRH l ?^i^Iy rGlul -euHisProAspLysTrpThrValGlnProI leMetLeuPrnm m 
2788 GGATGGGTTATGAACTCCATCCTGATAAATGGACAGTACAGCCTATAAT 

euThrGluVallleProLeuThrGluGluAlaGIuLeuGluLeuAlaGluAsnarafilii uf.* P 
TAACAGAAGTAATACCACTAACAGAAGAAGCAGAGCTAGAAXTGGCAGAAM K 

hrGluAlaValGlnLysValSerThrGluSerl leVal I leTrpGlyLvsl lePrnLvs iui L 
CAGAGGCAGTGCAAAAAGTATCCACAGAAAGCATAGTAATATGGGGAAAG^ 

FIG. 5C 
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w PheLysLeuProI leGlnLysGluThrTrpGluAlaTrpTrpMetGluTyrTrpGlriAlaT 
3268 TTAAACTACCCATACAAAAGGAAACATGGGAAGCATGGTGGATGGAGTATTGGCAAGCTA 

hrTrp 1 1 eProG 1 uTrpG 1 uPheVa 1 AsnThrProProLeuVa 1 LysLeuTrpTy rG 1 n 583 
CCTGGATTCCTGAGTGGGAGTTTGTCAATACCCCTCCCTTAGTGAAATTATGGTACCAGT 

3388 TAGAGAAAGAACCCATAGTAGGAGCAGAAACTTTCTATGTAGATGGGGCAGCTAATAGGG 

jl^I^lrYftey^AylrysAIaGlyTyr^^ 623 
AGACTAAATTAGGAAAAGCAGGATATGTTACTGACAGAGGAAGACAAAAAGTTGTCTCCA 

A^Xe^fiTX^¥^ai£g^fi?c r Sg^^gkXlg^Aif^Xg^Sl9§A^AS^?AG 703 

. AspLysLeuvalSerAlaGlylleArgLysvalLeuPheLeuAsnGlyl leAspLvsAlaG 
3748 ATAAATTAGTCAGTGCTGGAATCAGGAAAGTACTATTTTTGAATGGAATAGA^ 

i^I^lyyi^lylr^I^His^rAsnTrpArqAlaMetAlaSerAspPheAsnLeu 743 
AAGAAGAACATGAGAAATATCACAGTAATTGGAGAGCAATGGCTAGTGATTTTAACCTGC 

lf?£8 G A ^gA^A r a^gASgAA u A2A r ^gTXf?9?bf^f^ L XA s Vf^^aaS£^gc 823 
^^eT T £g^g^A a a^Uc% L XagA8G G AA u ¥Vfa^Aifc p c r ^X£SA s T n ^8eAA n fG^8^G 863 

AAACT AiiAGAACTACAAAAGCAAAT^ 

FIG.5D 
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AspAsnLysAspProLeuTrpLysGlyProA laLysLeuLeuTrpLysGlyGluGlyAlaV '''a-j 
4468 ACAACAAAGATCCCCTTTGGAAAGGACCAGCAAAGCTTCTCTGGAAAGGTGAAGGGGCAG... . 

alVal I leGlnAspAsnSerAspI leLysValValProArgArgLysAlaLysi leii le':983' 
TAGTAATACAAGATAATAGTGACATAAAAGTAGTGCCAAGAAGAAAAGCAAAAATCATTA > ..< 

ArgAspTyrGlyLysGlnMetAlaGlyAspAspCysValAlaSerArgGlnAspGluAspA - 
4588 GGGATTATGGAAAACAGATGGCAGGTGATGATTGTGTGGCAAGTAGACAGGATGAGGATT 

M 

AGAACATGGAAAAGTTTAGTAAAACACCATATGTATATTTCAAAGAAAGCTAAAGGATGG 
4708 TTTTATAGACATCACTATGAAAGTACTCATCCAAGAGTAAGTTCAGAAGTACACATCCCC 

CTAGGGGATGCTAAATTGGTAATAACAACATATTGGGGTCTGGATACAGGAGAAAGAGAA 
4828 TGGCATTTGGGCCAGGGAGTCGCCATAGAATGGAGGAAAAAGAAATATAGCACACAAGTA 

GACCCTGGCCTAGCAGACCAACTAATTCATCTGCATTATTTTGATTGTTTTTCAGAATCT 
4948 GCTATAAAAAATGCCATATTAGGATATAGAGTTAGTCCTAGGTGTGAATATCAAGCAGGA 

CATAACAAGGTAGGATCTCTACAATACTTGGCACTAGCAGCATTAATAACACCAAAAAAG 
5068 ACAAAGCCACCTTTGCCTAGTGTTAAGAAACTGACAGAGGATAGATGGAACAAGCCCCAG 

AAGACCAAGGGCCACAGAGGGAGCCATACAATGAATGGACACTAGAGCTTTTAGAGGAGC 
5188 TTAAGAGAGAAGCTGTTAGACATTTTCCTAGGCCATGGCTCCATAGCTTAGGACAATATA 

TCTATGAAACTTATGGGGATACTTGGGCAGGAGTGGAAGCCATAATAAGAATTCTGCAAC 
5308 AACTGCTGTTTATTCATTTCAGAATTGGGTGTCAACATAGCAGAATAGGCATTATTCAAC 

AGAGGAGAGCAAGAAGAAATGGAGCCAGTAGATCCTAATCTAGAGCCCTGGAAGCATCCA 
5428 GGAAGTCAGCCTAGGACTGCTTGTAACAATTGCTATTGTAAAAAGTGTTGCTTTCATTGC 

TACGCGTGTTTCACAAGAAAAGGCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGACAG 
5548 CGACGAAGAGCTCCTCAGGACAGTCAGACTCATCAAGCTTCTCTATCAAAGCAGTAAGTA 

GTAAATGTAATGCAATCTTTACAAATATTAGCAATAGTATCATTAGTAGTAGTAGCAATA 

5668 ATAGCAATAGTTGTGTGGACCATAGTACTCATAGAATATAGGAAAATATTAAGACAAAGA 

AAATAGACAGATTAATTGATAGAATAAGAGAAAAAGCAGAAGACAGTGGCAATGAAAGTG 3 

. nnn IrpLysGluAlaThrThrThrLeuPheCysAlaSerAspAlaArgAlaTyrAsDThrGlu 
5908 TGGAAAGAAGCAACTACCACTCTATTTTGTGCATCAGATGCTAGAGCATAT^ 

ValHisAsnValTrpAlaThrHlsAlaCysValProThrAspProAsnProGlnGluVal 83 
GTACATAATGTTTGGGCCACACATGCCTGTGTACCCACAGACCCC^ 

FIG. 5E 
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cr,oo y?il:?JJGlyAsnValThrGluAsnPheAsnMetTrpLysAsnAsnMetValGluGlnMet 
6028 GTATTGGGAAATGTGACAGAAAATTTTAACATGTGGAAAAATAACAT 

9lPSiy£§2Uf I leSerLeuTrpAspGlnSerLeuLysProCysValLysLeuThrPro 123 
CAGGAGGATATAATCAGTTTATGGGATCAAAGCCTAAAGCCATGTGTAAAATTAACCC 

n LeuCysValThrLeuAsnCysThrAspLeuGlyLysAlaThrAsnThrAsnSersprA<;n 
6148 CTCTGTGTTACTTTAAATTGCACTGATTTGGGGAAGGCTACTAATACCAAT^ 

IfRUf^i^iyn?!r^ G1 y GluIIeL ysAsnCysSerPheAsnIleThrThrSerI le 163 
TGGAAAGAAGAAATAAAAGGAGAAATAAAAAACTGCTCTTTCAATATCACCACAAGCAT 

6268 AGA^GATAAGATTCAGA^ 

A^C?^^^ 203 

r-,™ \ If T h rG 1 n A * a C y s P ro L y s Va 1 S e r PheG 1 u P ro I leProI leHi sTvrTvQThrPm 
6388 ATTACACAGGCCTGTCCAAAGGTATCATTTGAGCCAATTCC^ 

^f^^?Vf^l^^?bf^xg^sfgA s ¥S!?A L xI^^T p V5Sl?§GXA L xla^™fA T ^ ^2l,3 E 

6508a\«[»^ 

67*18 ^TAATAGGAGA?ATAAGa'aa'aGCACAT^ 

6868 ^A^cl!^^^ " 

FIG. 5F 
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TyrLysVall leLysI leGluProLeuGlyl leAlaProThrLysAlaLysArgArgVal 
7228 TATAAAGTAATAAAAATTGAACCATTAGGAATAGCACCCACCAAGGCAAAGAGAAGAGTG , ,.: <; 

ValGlnArgGluLysArgAlaValGlylleValGlyAlaMetPheLeuGlyPheLeuGly 523"" 
GTGCAGAGAGAAAAAAGAGCAGTGGGAATAGTAGGAGCTATGTTCCTTGGGTTCTTGGGA 

AlaAlaGlySerThrMetGlyAlaValSerLeuThrLeuThrValGlnAlaArgGlnLeu 
7318 GCAGCAGGAAGCACTATGGGCGCAGTGTCATTGACGCTGACGGTACAGGCCAGACAATTA 

LeuSerGlylleValGlnGlnGlnAsnAsnLeuLeuAr 563 
TTGTCTGGTATAGTGCAACAGCAGAACAATTTGCTGAGGGCTATTGAGGCGCAACAACAT 

_ ii „ LeuLeuG 1 nLeuTh rVa 1 TrpG 1 y 1 1 eLy sG 1 nLeuG 1 nA 1 aA rgVa 1 LeuA 1 aVa 1 G 1 u 
7168 CTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAGGCAAGAGTCCTGGCTGTGGAA 

^ir§Ty rLeuA rgAspG 1 nG 1 nLeuLeuG 1 y I leTrpGlyCysSerGlyLysLeuI leCys 603 
AGATACCTAAGGGATCAACAGCTCCTAGGGATTTGGGGTTGCTCTGGAAAACTCATTTGC 

TDrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluAspI leTrpAsp 
7588 ACCACTGCTGTGCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAAGACATTTGGGAT 

A ?P^MI^ etGlnTr "P GluAr 9 GluIleA spAsnTyrThrAsnThrIleTyrThr 613 
AACATGACCTGGATGCAGTGGGAAAGAGAAATTGACAATTACACAAACACAATATACACC 

77no LeuLeuG 1 uG 1 uSerG 1 n AsnG 1 nG 1 nG 1 uLysAsnG 1 uG 1 nG 1 uLeuLeuG 1 uLeuAsp 
7708 TTACTTGAAGAATCGCAGAACCAACAAGAAAAGAATGAACAAGAATTATTAGAATTGGAT 

LysTrpAlaSerLeuTrpAsnTrpPheSerl leThrAsnTrpLeuTrpTyrl leLysI le 683 
AAGTGGGCAAGTTTGTGGAATTGGTTTAGCATAACAAACTGGCTGTGGTATATAAAGATA 

7828 wMattisiiiaiiaiwo^mif^iim^fMwx E 

8308 ^^^H«G r ^iG n |^KG U G S i«^HVT L f^fA U ?A C AGATGGGTGGCAAGTGGTCAAA 
ACGTAGTATGGGTGGATGGTCTGCTATAAGGGAAAGAATGAGACGAGCTGAGCCACGAGC 
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8428 TGAGCCAGCAGCAGATGGGGTGGGAGCAGTATCTCGAGACCTGGAAAAACATGGAGCAAT 
CACAAGTAGCAATACAGCAGCTACTAATGCTGATTGTGCCTGGCTAGAAGCACAAGAGGA 
8548 GGAAGAGGTGGGTTTTCCAGTCAGACCTCAGGTACCTTTAAGACCAATGACTTACAAGGC 

U3-*- 

. AGCTTTAGATATTAGCCACTTTTTAAAAGAAAAGGGGGGA CTGGAAGGGCTAATTTGGT [_ 
8667 CCCAAAGAAGACAAGAGATCCTTGATCTGTGGATCTACCACACACAAGGCTACTTCCCTG 

ATTGGCAGAATTACACACCAGGGCCAGGGATCAGATATCCACTGACCTTTGGATGGTGCT ^ 
8787 TCAAGCTAGTACCAGTTGAGCCAGAGAAGGTAGAAGAGGCCAATGAAGGAGAGAACAACA 

GCTTGTTACACCCTATGAGCCTGCATGGGATGGAGGACGCGGAGAAAGAAGTGTTAGTGT 
8907 GGAGGTTTGACAGCAAACTAGCATTTCATCACATGGCCCGAGAGCTGCATCCGGAGTACT R 

ACAAAGACTGCTGACATCGAGCTTTCTACAAGGGACTTTCCGCTGGGGACTTTCCAGGGA 

9027 GGCGTGGCCTGGGCGGGACTGGGGAGTGGCGTCCCTCAGATGCTGCATATAAGCAGCTGC 
— -U3 R-»- 

TTTTTGCCTGTACTG GGTCTCTCTGGTTAGACCAGATCTGAGCCTGGGAGCTCTCTGGC 

9146 TAACTAGGGAACCCACTGCTTAAGCCTCMtAMGCTTGCCTTGAGTGCTTCA AGTAGT 
GTGTGCCCGTCTGTTGTGTGACTCTGGTAACTAGAGATCCCTCAGACCCTTTTAGTCAGT 

9265 GTGGAAAAATCTCTAGCAG . 
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ptac 5 Promotor 



MetlleVal 
ATGATCGTA 



GlnAsnLeuGlnGlyGlnMetValHisGlnAlalleSerProArgThrLeuAsnAlaTrp 
748 CAGAATCTGCAGGGGCAAATGGTACATCAGGCCATATCACCTAGAACTTTAAATGCATGG 

ValLysValValGluGluLysAlaPheSerProGluVal I leProMetPheSerAlaLeu 181 
GTAAAAGTAGTAGAAGAAAAGGCTTTCAGCCCAGAAGTAATACCCATGTTTTCAGCATTA 

SerGluGlyAlaThrProGlnAspLeuAsnThrMetLeuAsnThrValGlyGlyHisGln 
868 TCAGAAGGAGCCACCCCACAAGATTTAAACACCATGCTAAACACAGTGGGGGGACATCAA 

AlaAlaMetGlnMetLeuLysGluThrlleAsnGluGluAlaA^ 221 
GCAGCCATGCAAATGTTAAAAGAGACTATCAATGAGGAAGCTGCAGAATGGGATAGAGTG 

„„„ HisProValHisAlaGlyProI leAlaProGlyGlnMetArgGluProArgGlySerAsp 
988 CATCCAGTGCATG CAGGGCCTATTGCACCAGGCCAAATGAGAGAACCAAGGGGAAGTGAC 

UeAlaGlyThrThrSerThrLeuGlnGluGlnlleGlyTrpM^ 261 
ATAGCAGGAACTACTAGTACCCTTCAGGAACAAATAGGATGGATGACAAATAATCCACCT 

, ' _ I leProValGlyGluI leTyrLysArgTrpI lei leLeuGlyLeuAsnLysI leValArg 
1108 ATCCCAGTAGGAGAAATCTATAAAAGATGGATAATCCTGGGATTAAATAAAATAGTAAGA 

MetTyrSerProThrSerl leLeuAspI leArgGlnGlyProLysGluProPheArgAsp 301 
ATGTATAGCCCTACCAGCATTCTGGACATAAGACAAGGACCAAAGGAACCCTTTAGAGAT 

,o^o lYCyalAspArgPheTyrLysThrLeuArgAlaGluGlnAlaSerGlnAspValLysAsn 
1228 TATGTAGACCGGTTCTATAAAACTCTAAGAGCCGAACAAGCTTCACA^ 

ItR^JTtJC^iyThfLeuLeLiValGlnAsnAlaAsnProAspCysLysThrl leLeuLvs 341 
TGGATGACAGAAACCTTGTTGGTCCAAAATGCAAACCCAGATTGTAAGACTATTTTA^ 

k$y& y £t°M&M\! r M uG 1 uG 1 uMetMetThrA 1 aCysG InGlyValGlyGly 
GCATTGGGACCAGCAGCTACACTAGAAGAAATGATGACAGCATGTCAGGGAGTGGGGGGA 

ProGlyHisLysAIaArgValLeu Stop Stop 
CCCGGGCATAAAGCAAGAGTTTTGTGATAG M 



1348 



Ptac 5 
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ptac 5 Promotor 



Me 1 1 leva 1 141 
ATGATCGTA , 



GlnAsnLeuGlnGlyGlnMetValHisGlnAlal leSerProArgThrLeuAsnAlaTrp 
748 CAGAATCTG CAGGGGCAAATGGTACATCAGGCCATATCACCTAGAACTTTAAATGCATGG 

ValLysValValGluGluLysAlaPheSerProGluVall leProMetPheSerAlaLeu 181 
GTAAAAGTAGTAGAAGAAAAGGCTTTCAGCCCAGAAGTAATACCCATGTTTTCAGCATTA 

SerGluGlyAlaThrProGlnAspLeuAsnThrMetLeuAsnThrValGlyGlyHlsGln 
868 TCAGAAGGAGCCACCCCACAAGATTTAAACACCATGCTAAACACAGTGGGGGGACATCAA 

AlaAlaMetGlnMetLeuLysGluThrl leAsnGluGluAlaAlaGluTrpAspArgVal 221 
GCAGCCATGCAAATGTTAAAAGAGACTATCAATGAGGAAGCTGCAGAATGGGATAGAGTG 

HisProValHisAlaGlyProI leAlaProGlyGlnMetArgGluProArgGlySerAsp 
988 CATCCAGTGCATGCAGGGCCTATTGCACCAGGCCAAATGAGAGAACCAAGGGGAAGTGAC 

I leAlaGlyThrThrSerThrLeuGlnGluGlnl leGlyTrpMetThrAsnAsnProPro 261 
ATAGCAGGAACTACTAGTACCCTTCAGGAACAAATAGGATGGATGACAAATAATCCACCT 

I leProValGlyGluI leTyrLysArgTrpI lei leLeuGlyLeuAsnLysI leValArg 
1108 ATCCCAGTAGGAGAAATCTATAAAAGATGGATAATCCTGGGATTAAATAAAATAGTAAGA 

MetTyrSerProThrSerl leLeuAspI leArgGlnGlyProLysGluProPheArgAsp 301 
ATGTATAGCCCTACCAGCATTCTGGACATAAGACAAGGACCAAAGGAACCCTTTAGAGAT 

TyrValAspArgPheTyrLysThrLeuArgAlaGluGlnAlaSerGlnAspValLysAsn 
1228 TATGTAGACCGGTTCTATAAAACTCTAAGAGCCGAACAAGCTTCACAGGATGTAAAAAAT 

TrpMetThrGluThrLeuLeuValGlnAsnAlaAsnProAspCysLysThrl leLeuLys 341 
TGGATGACAGAAACCTTGTTGGTCCAAAATGCAAACCCAGATTGTAAGACTATTTTAAAA 

n AlaLeuGlyProAlaAlaThrLeuGluGluMetMetThrAlaCysGlnGlyValGlyGly 
1348 GCATTGGGACCAGCAGCTACACTAGAAGAAATGATGACAGCATGTCAGGGAGTGGGGGGA 

ProGlyHisLysAlaArgValLeuAlaGluAlaMetSerGlnValThrAsnProAlaAsn 381 
CCCGGCCATAAAGCAAGAGTTTTGGCTGAAGCCATGAGCCAAGTAACAAATCCAGCTAAC 

, I leMetMetGlnArgGlyAsnPheArgAsnGlnArgLysThrValLvsCvsPhpAsnf v<? 
1^68 ATAATGATGCAGAGAGGCAATTTTAGGAACCAAAGA AAGACTGTTAAG 

^KU?5Iy5I^Hi SIleAlal -ysAsnCysArgAlaProArgLysLysGlyCysTrDAra 4?i 
GGCAAAGAAGGGCACATAGCCAAAAATTGCAGGGCCCCTAGGAAAAAGGGCTG^ 

CysGlyArgGluGlyHlsGlnMetLysAspCysThrGluArgGlnAlaAsnPheLeuGly 

1588 TGTGGAAGGGAAGGACACCAAATGAAAGATTGCACTGAGAGACAGGCTAATTTTTTAGGG 

i^iifIr^ r ^ e r Ty ^ l r y ?9 lyAr ^ roG1 y AsnPh eLeuGlnSerArgProGluPro 461 
1 uAspLeuA 1 aPheLeuG 1 nG 1 yLys A aA rgG 1 uPheSerSe rG 1 uG 1 nThr AraAla 23 

AAGATCTGGCCTTCCTACAAGGGAAGGCCAGGGAATTTTCnCAGAGCAGACCA^ 

ThrAlaProProGluGluSerPheArgPheGlyGluGluLysThrThrProSerGlnLvs 
_ AsnSerProThrArgArgGluLeuGlnValTrpGlyGlyGluAsnA«?nSpri PnSprninfl 

1708 acagccccaccagaagagagcttcaggtUggggaggagaaa^ 

firnly?^ 1 1 ? As R l r y ? G1 y!: e , u TyrProLeuThrSerLeuArgSerLeuPheGlyAsn 501 
laGlyAlaAspArgGlnGlyThrVa SerPheAsnPhpPrnGini lPThri p Trnrin 

CAGGAGCCGATAGACAAGGAACTGTATCCTTTAACTTCCC^^ 
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AspProSerSerGlnOC 

ArgProLeuValThrl leArgl leGlyGlyGlnLeuLysGluA IaLeuLeuAspThrGlyA 
1828 GACCCCTCGTCACAATAAGGATAGGGGGGCAACTAAAGGAAGCTCTATTAGATACAGGAG 

1 aAspAspThrVa 1 LeuG 1 uG 1 uMetAsnLeuProG lyLysTrpLysProLysMet He 103 
CAGATGATACAGTATTAGAAGAAATGAATTTGCCAGGAAAATGGAAACCAAAAATGATAG 

in , 0 G 1 yG 1 y 1 1 eG 1 yG 1 y Phe 1 1 eLy sVa 1 A rgG 1 nTy r A spG 1 n I ieProValGluI leCysG 
1948 GGGGAATTGGAGGTTTTATCAAAGTAAGACAGTACGATCAGATACCTGTAGAAATCTGTG 

lyHIsLysAlalleGlyThrValLeuValGlyProThrProValAsnllelleGlyArg 143 
GACATAAAGCTATAGGTACAGTATTAGTAGGACCTACACCTGTCAACATAATTGGAAGAA 

0 „^„AsnLeuLeuThrGlnIleGlyCysThrLeuAsnPheProIleSerProIleGluThrValP 
2068 ATCTGTTGACTCAGATTGGTTGTACTTTAAATTTCCCCATTAGTCCTATTGAAACTGTAC 

ToValLysLeuLysProGlyMetAspGlyProLysValLysGlnTrpProLeuThrGlu 183 
CAGTAAAATTAAAGCCAGGAATGGATGGCCCAAAAGTTAAGCAATGGCCATTGACA^ 

„ G lyU§U?LysAlaLeuValGluIleCysThrGluMetGluLysGluGlyLysI leSerL 
2188 AAAAAATAAAAGCATTAGTAGAGATATGTACAGAAATGGAAAAGGAAGGGAAAATTTCAA 

ysl leGlyProG 1 uAsnProTy rAsnThrProVa 1 PheA la I leLysLysLysAspSer 223 
AAATTGGGCCTGAAAATCCATACAATACTCCAGTATTTGCTATAAA^ 

^,«« T Dt!rV§l!;P Ar 9 L y s, - euV alAspPheArgGluLeuAsnLysArgThrGlnAspPheTrpG 
2308 CTAAATGGAGAAAACTAGTAGATTTCAGAGAACTTAATAAAAGAACTCAAGAC^ 

AAGTTCAGnAGGAWACCA^ 
278 8 T ®gW^^ 

2908 G ASAifTXfgcfi G ^ifkX^fAkKg8A8Vf^ 

FIG. 9B l 
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I leLeuLysGluProValHisGluValTyrTyrAspProSerLysAspLeuValAlaGluI 
3028 TTCTAAAAGAACCAGTACATGAAGTATATTATGACCCATCAAAAGACTTAGTAGCAGAAA 

leGlnLysGlnGlyGlnGlyGlnTrpThrTyrGlnlle 503 
TACAGAAGCAGGGGCAAGGCCAATGGACATATCAAATTTATCAAGAGCCATTTAAAAATC 



LeuLysThrGlyLysTyrAlaArgMetArgGlyAlaHisThrAsnAspValLysGlnLeuT 
3148 TGAAAACAGGAAAGTATGCAAGGATGAGGGGTGCCCACACTAATGATGTAAAACA 

3] 
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Ju^liiMaVai^LuLysValSerThrGluSerlleVallleTrpGlyLysIleProLys 5t*3 
I ptac 5 



ARV GAG pl6 - synthetic Parts A and B 

5' arv 234 3/ 

MetGlnArgGlyAsnPheArgWsnGlnArgLysThrValLysCysPheAsnCysGlyLys 
TATTATGCAAAGAGGTAACTTCAGGfeATCAAAGAAAGACCGTTAAGTGTTTCAACTGTGGTAAG 
ATAATACGTTTCTCCATTGAAGTCCTTA1GTTTCTTTCTGGCAATTCACAAAGTTGACACCATTC 
3' ~ arv 235 ~ 5' 1 

10 mnll, 23 hinfl, 

GluGlyHlsI leAlaLysAsnCysArgAlaProArgLysLysAlaCysTrpArgCysGly 
63 GAAGGTCACATCGCTAAGAACTGTAGAGCTCCAAGAAAGAAGGCTTGTTGGAGATGTGGT 
CTTCCAGTGTAGCGATTCTTGACATCTCGAGGTTCTTTCTTCCGAACAACCTCTACACCA 

76 ddel, 88 ban2 hgiA hglJll sacl sdul, 89 alul, 

ArgGluGlyHisGlnMetLysAspCysThrGluArgGlnAlaAsnPheLeuGlyLysI le 
123 AGAGAAGGTCACCAAATGAAGGACTGTACCGAAAGACAAGCTAACTTCTTGGGTAAGATC 
TCTCTTCCAGTGGTTTACTTCCTGACATGGCTTTCTGTTCGATTGAAGAACCCATTCTAG 

129 bstE2, 131 hph, 118 rsal, 161 alul, 178 bglll xho2, 179 
sau3a, 

, o, TrpProSe^yrLysG lyArgProG lyAsnPheLeuG InSerA rgProG 1 uProThrA 1 a 
183 TGGCCATCTTACAAGGGTAGACCAGGTAACTTCTTGCAATCCAGACCAGAACCAACCGCT 
ACCGGTAGAATGTTCCCATCTGGTCCATTGAAGAACGTTAGGTCTGGTCTTGGTTGGCGA 

183 ball cfrl hael, 181 haelll, 199 accl, 204 apyl ecorll sc 

Iri i 

ProProGluGluSerPheArgPheGlyGluGluLysThrThrProSerGlnLysGlnGlu 
243 CCACCTGAAGAAAGTTTCAGGn 

GGTGGACTTCTTTCAAAGTCCAAGCCACTTCTTTTCTGGTGGGGTAGAGTTTTCGTTCTT 
249 mboll, 267 hph. 270 mboll. 

_„„ Pro I leAspLysGluLeuTyrProLeuThrSerLeuArgSerLeuPheGlvAsnAsDPrn 

GGTTAGCTGTTCCTTAACATGGGTAACTGGAGAAACTCTAGGAACAAGCCATTGCTAGGG 

3 30 6 ( Sfl: I 2 62° SSSl'xgL"" 11 ' 339 xh °*' 3 "° sau3a ' 357 ~ sau3a - 

SerSerGlnOP AM 
363 TCGAGCCAATGATAG 

AGCTCGGTTACTATCAGCT 

363 taql, 377 accl hindll sail 

FIG. 10 
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MetSer 

PYK Promoter ATGTCT 



Arg I leAspCysSerA laThrG luLysLeuTrpVa IThrVa lTy rTy rG 1 yVa 1 ProVa 1 51 
AGAATCGAT GTAGTGCTACAGAAAAATTGTGGGTCACAGTTTATTATGGAGTACCTGTG 

TrpLysGluAlaThrThrThrLeuPheCysAlaSerAspAlaArgAlaTyrAspThrGlu 
5908 TGGAAAGAAGCAACTACCACTCTATTTTGTGCATCAGATGCTAGAGCATATGATACAGAG 

ValHisAsnValTrpAlaThrHisAlaCysValProThrAspProAsnProGlnGluVal 91 
GTACATAATGTTTGGGCCACACATGCCTGTGTACCCACAGACCCCAACCCACAAGAAGTA 

ValLeuGlyAsnValThrGluAsnPheAsnMetTrpLysAsnAsnMetValGluGlnMet 

6028 GTATTGGGAAATGTGACAGAAAATTTTAACATGTGGAAAAATAACATGGTAGAACAGATG 

GlnGluAspI lei leSerLeuTrpAspGlnSerLeuLysProCysValLysLeuThrPro 131 
CAGGAGGATATAATCAGTTTATGGGATCAAAGCCTAAAGCCATGTGTAAAATTAACCCCA 

rn , „ Ley^§y§iTDC LeuAsnC y sThrAs P Leu61 y L ysAlaThrAsnThrAsnSerSerAsn 
6148 CTCTGTGTTACTTTAAATTGCACTGATTTGGGGAAGGCTACTAATACCAATAGTAGTAAT 

I^Pt^?51y§iyi leLysGlyGluI leLysAsnCysSerPheAsnl leThrThrSerl le 171 
TGGAAAGAAGAAATAAAAGGAGAAATAAAAAACTGCTCTTTCAATATCACCACAAGCATA 



6388 



Ii?I^5ipAlaCysProLysValSerPheGIuProI leProI leHIsTyrCysThrPro 
ATTACACAGGCCTGTCCAAAGGTATCATTTGAGCCAATTCCCATACATTATTGTACCCCG 

AlaGlyPheAlalleLeuLy^ 251 
GCTGGTTTTGCGATTCTAAAGTGTAATAATAAAACGTTCAATGGAAAAGGACCATGTACA 

FIG. 11A 
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AsnValSerThrValGlnCysThrHisGlyl leArgProI leValSerThrGlnLeuLeu 
6508 AATGTCAGCACAGTACAATGTACACATGGAATTAGGCCAATAGTG^ 

LeuAsnGlySerLeuAlaGluGluGluValVallleArgSerAspAsnPheThrAsnAsn 291 

ttaaatggcagtctagcagaagaagaggtagtaattagatctgacaatttcacgaacaS 

AlaLysThrl lei leValGlnLeuAsnGluSerValAlal leAsnCvsThrAraPrnA^n 
6628 GCTAAAACCATAATAGTACAGCTGAATGAATCTGTAGCAATTA^ 

AsnAsnThrArgLysSerl leTyrl leGlyProGlyArgAlaPheHisThrThrGlvAra 331 
AACAATACAAGAAAAAGTATCTATATAGGACCAGGGAGAGCATTTCATACAAC^ 

67,8 MlSM 

TT AGAACAGATAgIIa^ ™ 

™™ AsnGlnSerSerGlyGlyAspProGluI leValMetHlsSerPheAsnCvsAraGl vGin 
6868 AATCAATCCTCAGGAGGGGACCCAGAAATTGTAATGCACAGTnT^ 

m?MMm n !:f!^»jK}^W«I»1J[j( " 1 E 

7 108 f?A r 5l9ijfA T eA r |G^f^?^f^ T fA^^ r ^A S f§G^ 6 Y^A r SA S ¥^iA T C h «l9gl8J0c r 
G 1 uVa 1 Phe A rgProG 1 yG 1 yG 1 yAspMetArgAspAsnTrpArgSerGluLeuTvri vs 

gaggtcttcagacctggaggaggagatatgagggacaattggagaagtgaaHata^ 
7228 tataaagtaataaaaattgaaccaa attcggtatcttga 



,91 V 



PYK Terminator 
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Nucleotide Met I leAspLysAlaGlnGluGluHisGluLys.TyrHisSerAsnTrp 

pos i t i ons 1 AGGXAACAG : : : : ATGAT : GA : AAGGCACAAGAAGAACATGAGAAATATCACAG TAATTGG 
relative to TCCXTTGTC: : : jTACTAjCTiTTCCGTGTTCTTCTTGTACTCTTTATAGTGTGATTAACC 



Figure 5. 



32 mboll, 38 nlalll, 



ArgAlaMetAlaSerAspPheAsnLeuProProValValAlaLysGluI leValAlaSer 
3820 62 AGAGCCATGGCTAGTGATTTTAACCTGCCACCTGTAGTAGCAAAAGAAATAGTAGCCAGC 

TCTCGGTACCGATCACTAAAATTGGACGGTGGACATCATCGTTTTCTTTATCATCGGTCG 

66 ncol, 67 nlalll, 118 nspBII pvull, 119 alul, 

CysAspLysCysGlnLeuLysGlyGluAlaMetHisGlyGlnValAspCysSerProGly 
3880 122 TGTGATAAATGTCAGCTAAAAGGAGAAGCCATGCATGGACAAGTAGACTGTAGTCCAGGA 

ACACTATTTACAGTCGATTTTCCTCTTCGGTACGTACCTGTTCATCTGACATCAGGTCCT 

135 alul, 151 nlalll, 152 nsll ava3, 155 nlalll, 164 accl, 1 
76 apyl bstXI ecorll scrFl, 

I leTrpGlnLeuAspCysThrHisLeuGluGlyLysI lei leLeuValAlaValHisVal 
3940 182 ATATGGCAACTAGATTGTACACATCTAGAAGGAAAAATTATCCTGGTAGCAGTTCATGTA 

TATACCGTTGATCTAACATGTGTAGATCTTCCTTTTTAATAGGACCATCGTCAAGTACAT 

198 rsal, 205 xbal, 223 apyl ecorll scrFl, 236 nlalll, 

AlaSerGlyTyrl leGluAlaGluVal I leProAlaGluThrGlyGlnGluThrAlaTyr 
4000 242 GCCAGTGGATATATAGAAGCAGAAGTTATTCCAGCAGAGACAGGGCAGGAAACAGCATAT 

CGGTCACCTATATATCTTCGTCTTCAATAAGGTCGTCTCTGTCCCGTCCTTTGTCGTATA 

263 xmnl, 

,™ EDlUyUV t v § LeuA laG lyArgTrpProVa 1 LysThr 1 1 eH i slhrAspAsnG 1 ySe r 
4060 302 TTTCTCTTAAAATTAGCAGGAAGATGGCCAGTAAAAACAATACATACAGACAATGGCAGC 

AAAGAGAATTTTAATCGTCCTTCTACCGGTCATTTTTGTTATGTATGTCTGTTACCGTCG 

321 mboll, 326 ball cfrl hael, 327 haelll, 357 bbv fnu4lil, 

AsnP.heThrSerThrThrValLysAlaAlaCysTrpTrpAlaGlyl leLysGlnGluPhe 
4120 362 AATTTCACCAGTACTACGGT 

TTAAAGTGGTCATGATGCCAATTCCGGCGGACAACCACCCGTCCCTAGTTCGTCCTTAAA 

366 hph, 371 seal. 372 rsal, 385 haelll, 386 fnu4hl nsbll, 4 
05 blnl, 406 donl sau3a, 

/.ion - ,™ Gly. Ilepr oTyrAsnProGlnSerGlnGlyValValGluSerMetAsnAsnGluLeuLys 
4180 422 GGCATTCCCTACAATCCa 

CCGTAAGGGATGTTAGGGGTTTCAGTTCCTCATCATCTTAGATACTTATTACTTAATTTC 
423 bsml, 458 hinfl, 

inim t ,on !r iff i I? 14-f 5Jly5 1 PX?i ^ rgA S P" G 1 n A 1 aG 1 uH i sLeuLysTh r A 1 aVa 1 G 1 nMe t A 1 a 

4240 482 AAAATTATAGGACAGGTAAGAG^ 

TTTTAATATCCTGTCCATTCTCTAGTCCGACTTGTGGAATTCTGTCGTCATGTTTACCGT 
503 dpnl sau3a, 518 af 111. 530 rsal, 

ilWl Klin 

4300 542 pTATTCATCCACAATTTTAAAAGAAA^ 

CATAAGTAGGTGTTAAAATTTTCTTTTCCCCCCTAACCCCCTATGTCACGTCCCCTTTCT 

547 fokl, 557 ahalll, 

FIG. 12A 
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™„ I leVal Aspl lei leA laThrAspI leGlnThrLysGluLeuGlnLysGlnl leThrLys 
4360 602 ATAGTAGACATAATAGCAACAGACATACAAACTAAAGAACTACAAAAGCAAATTACAAAA 

TATCATCTGTATTATCGTTGTCTGTATGTTTGATTTCTTGATGTTTTCGTTTAATGTTTT 

605 accl, 

Ii?GlnAsnPheArgValTyrTyrArgAspAsnLysAspProLeuTrpLysGlyProAla 
4420 662 ATTCAAAATTTTCGGGTTTATTACAGGGACAACAAAGATCCCCTTTGGAAAGGACCAGCA 

TAAGTTTTAAAAGCCCAAATAATGTCCCTGTTGTTTCTAGGGGAAACCTTTCCTGGTCGT 

697 xho2, 698 dpnl sau3a, 713 asul ava2, 

_ LysLeuLeuTrpLysGlyGluGlyAlaValVal I leGlnAspAsnSerAspI leLysVai 
4480 722 AAGCTTCTCJGGAAAGGTGAAGGGG 

TTCGAAGAGACCTTTCCACTTCCCCGTCATCATTATGTTCTATTATCACTGTATTTTCAT 
722 hindlll, 723 alul, 737 hph, 

z.c.n ValProArgArgLysAlaLysI lei leArgAspTyrGlyLysGlnMetAlaGlyAsuAso 

4540 782 GTGCCAAGAAGAAAA^ 

CACGGTTCTTCTTTTCGTTTTTAGTAATCCCTAATACCTTTTGTCTACCGTCCACTACTA 
789 mboll, 833 hph, 

i,cnn o,o CysValAlaSerArgGlnAspGluAspAM 

85 s7fl C £jAr? 59 fok1 ' 863 mnl1 ' 871 accl hindll sail, 872 taal 



FIG. 12B 



931154 



ro 



CD m j 



CO 

O 

CD 
OJ 
(T 

O 
CD 
O 
X 
CL 
O 

cr 

Q 

>- 

X 



/ 

/ 

/ 

/,! 

// 
// 
// 



■ o 

JZ 

X 



< 



UJO 
I 

O 



o 

CO 
Q. 



o 
o 



UJ 



CD 
< 



CL 



y 



I 

o 
x 



i 

-o 

< 



CO 
GO 



ro 
i 

> 
c 



If) 
I 

> 
C 
CD 



CP 

o 

7>% 



CO 
I 

> 
c 

CD 



I 

> 

c 

CD 



1§ 

UJ Q 

cr z: 
a < 
x 

UJ 

> 
c 



> 
c 

CD 



f] 931154 



BSIED769 




Bgl 15545 



FIG. 14 



01 931154 



242 



SOD 

MetAlaThrLysAlaValCysValLeuLysGlyAspGlyProValGlnGlyllelleAsn 
1 CATGGCGACGAAGGCCGTGTGCGTGCTGAAGGGCGACGGCCCAGTGCAGGGCATCATCAAT 
CGCTGCTTCCGGCACACGCACGACTTCCCGCTGCCGGGTCACGTCCCGTAGTAGTTA 

PheGluGlnLysGluSerAsnGlyProValLysValTrpGlySerlleLysGlyLeuThr 
62 TTCGAGCAGAAGGAAAGTAATGGACCAGTGAAGGTGTGGGGAAGCATTAAAGGACTGACT 
AAGCTCGTCTTCCTTTCATTACCTGGTCACTTCCACACCCCTTCGTAATTTCCTGACTGA 

GluGlyLeuHisGlyPheHlsValHisGluPheGlyAspAsnThrAlaGlyCysThrSer 
122 GAAGGCCTGCATGGATTCCATGTTCATGAGTTTGGAGATAATACAGCAGGCTGTACCAGT 
CTTCCGGACGTACCTAAGGTACAAGTACTCAAACCTCTATTATGTCGTCCGACATGGTCA 

AlaGlyProHlsPheAsnProLeuSerArgLysHisGlyGlyProlysAspGluGl uArg 
182 GCAGGTCCTCACTTTAATCCTCTATCCAGAAAACACGGTGGGCCAAAGGATGAAGAGAGG 
CGTCCAGGAGTGAAATTAGGAGATAGGTCTTTTGTGCCACCCGGTTTCCTACTTCTCTCC 

HisValGlyAspLeyGlyAsnValThrAlaAspLysAspGlyVal^ 
CATGTTGGAGACTTGGGCAATGTGACTGCTGACAAAGATGGTGTGGCCGATGTGTCTATT 
GTACAACCTCTGAACCCGTTACACTGACGACTGTTTCTACCACACCGGCTACACAGATAA 

_ no Gl uAspSerVa 1 1 leSerLeuSerG lyAspH isCys I lei leG lyArgThrLeuVa lVa 1 
302 GAA6ATTCTGTGATCTCACTCTCAGGAG 

CTTCTAAGACACTAGAGTGAGAGTCCTCTGGTAACGTAGTAACCGGCGTGTGACCACCAG 

tco ^l^ly^f^^5S^RUy9I^ ysG1 y G1 y AsnGluG luSerThrLysThrGlyAsn 
362 CATGAAAAAGCAGATGACT^ 

GTACTTTTTCGTCTACTGAACCCGTTTCCACCTTTACTTCTTTCATGTTTCTGTCCTTTG 



422 
482 
542 
602 



722 
782 



CGACCTTCAGCAAACCGAAC^ 

GTGAACGACGTCAACTGGCAAACCCCATAGTTCGTCAACGTCCGATCTCAAAACCGACAA 

^iy A ^?J y ^Sy ArgAspGlnGlnLe uLeuGlyI leTrpGlyCysSerGlyLysLeuI le 
GAAAGA I A CTTGAGAGATCAACAATTGTTGGGTATCT^ 
CTTTCTATGAACTCTCTAGTTGTTAACAACCCATAGACCCCAACAAGAa 

CysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSerLeuGluAqoi lpTm 
TGTACCACCGCTGTTCCCTGGAACGCTTCTTGGTCTAACAA^ 
ACATGGTGGCGACAAGGGACCTTGCGAAGAACCAGATTGTTCAGAAACCn 

ThrLeuLeuGluGluSerGlnAsnGlnGlnGluLysAsnGluGlnGluLeuLeuGIuipij 

TGGAACAACCTCCTTAGAGTTTTGGTTGTTCTTTTCTTGCTTGTTCTTAACAACCTTAAC 
A spLysTrpAlaSerLeuTrpAsnTrpPheSerIleThrAsnTrpAM 

Translated Mol. Weight = 30414.22 

FIG. 15 
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Ncol @) ^ S< j rFI 

C ATG CCT ATA GTG CAG AAT C TG CAll tsvnthetic DIMa|( arv-P niMA^ Ei 

| GGA TAT CAC GTC TTA Cli 1^ -ft! 



GGG CAT AAA GCA AGA GTT TTG TGA TAGL_— . , 
r.r.r. rtattt cgt tot p.aa aac act atc AGCTl 
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■gel purification 
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'/, cut 
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C 


1 

Met 
ATG 


Pro 
CCT 


He 

ATA 

ATA 


Val 
GTG 


Gin 
CAG 


Asn 
AAT 


Leu 
CTG 


Gin 
CAG 


Gly 
GGG 


10 
Gin 
CAA 


Met 
ATG 


Val 
GTA 


His 
CAT 


Gin 
CAG 


Ala 

err 


He 
ATA 


Ser 
TCA 


Pro 

LL 1 


Arg 
AGA 


20 
Thr 
ACT 


Leu 

TT A 
1 1 A 


Asn 

AAT 


Ala 
GCT 


Trp Val 
TGG GTA 


Lys 
AAA 


Val 
GTA 


Val 
GTA 


Glu 
GAA 


30 
Glu 
GAA 


Lys 
AAG 


Ala 
GCT 


Phe 
TTC 


Ser 
AGC 


Pro 
CCA 


Glu 
GAA 


Val 

GTA 


He 
ATA 


Pro 
CCC 


i\0 
Met 
ATG 


Phe 
TTT 


Ser 
TCA 


Ala 
GCA 


Leu 
TTA 


Ser 
TCA 


Glu 
GAA 


Gly 
GGA 


Ala 
GCC 


Thr 
ACC 


50 
Pro 
CCT 


Gin 
CAA 


Asp 
GAT 


Leu 
TTA 


Asn 
AAC 


Thr 
ACC 


Met 
ATG 


Leu 
CTA 


Asn 
AAC 


Thr 
ACA 


60 
Val 
GTG 


Gly Gly 
GGG GGA 


His 
CAT 


Gin 
CAA 


Ala 
GCA 


Ala 
GCC 


Met 
ATG 


Gin 
CAA 


Met 
ATG 


70 
Leu 
TTA 


Lys 
AAA 


Glu 
GAG 


Thr 
ACT 


He 
ATC 


Asn 
AAT 


Glu 
GAG 


Glu 
GAG 


Ala 
GCT 


Ala 
GCC 


80 
Glu 
GAA 


Trp 
TGG 


Asp 
GAT 


Arg 
AGA 


Val 
GTG 


His 
CAT 


Pro 
CCA 


Val 
GTG 


His 
CAT 


Ala 
GCA 


90 
Gly 
GGG 


Pro 
CCT 


He 
ATT 


Ala 
GCA 


Pro Gly 
CCA GGC 


Gin 
CAA 


Met 
ATG 


Arg Glu 
AGA GAA 


100 
Pro 
CCA 


Arg Gly Ser 
AGG GGA AGT 


Asp 
GAC 
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110 

lie Ala Gly Thr Thr Ser Thr Leu Gin Glu Gin He Gly Trp Met 
ATA GCA GGA ACT ACT AGT ACC CTT CAG GAA CAA ATA GGA TGG ATG 

120 130 

Thr Asn Asn Pro Pro He Pro Val Gly Glu He Tyr Lys Arg Trp 

ACA AAT AAT CCA CCT ATC CCA GTA GGA GAA ATC TAT AAA AGA TGG 

110 

lie He Leu Gly Leu Asn Lys lie Val Arg Met Tyr Ser Pro Thr 
ATA ATC CTG GGA TTA AAT AAA ATA GTA AGA ATG TAT AGC CCT ACC 

150 160 

Ser He Leu Asp He Arg Gin Gly Pro Lys Glu Pro Phe Arg Asp 

AGC ATT CTG GAC ATA AGA CAA GGA CCA AAG GAA CCC TTT AGA GAT 

170 

Tyr Val Asp Arg Phe Tyr Lys Thr Leu Arg Ala Glu Gin Ala Ser 
TAT GTA GAC CGG TTC TAT AAA ACT CTA AGA GCC GAA CAA GCT TCA 

180 190 

Gin Asp Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val Gin Asn 

CAG GAT GTA AAA AAT TGG ATG ACA GAA ACC TTG TTG GTC CAA AAT 

. 200 

r\\ SfP ££9 TPf II§ Leu Lvs Ala Leu Gly Pro Ala 

GCA AAC CCA GAT TGT AAG ACT ATT TTA AAA GCA TTG GGA CCA GCA 

210 220 

Ala Thr Ley Glu Glu Met Met Thr Ala Cys Gin Gly Val Glv Glv 

GCT ACA CTA GAA GAA ATG ATG ACA GCA TGT CAG GGA GTG GGG GGA 

230 232 
Pro Gly His Lys Ala Arg Val Leu OP 
CC C GGG CAT AAA GCA AGA GTT TTG TGA TAG 

Translated Mol. Weight = 25700.75 

FIG. 22B 
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SOD — 

MetAlaThrLysAla 
ATGGCTACAAAGGCT 
TACCGATGTTTCCGA 

ValCysValLeuLysGlyAspGlyProValGlnGlyl lei leAsnPheGluGlnLysGlu 
1383 GTTTGTGTTTTGAAGGGTGACGGCCCAGTTCAAGGTATTATTAACTTCGAGGAGAAGGAA 
CAAACACAAAACTTCCCACTGCCGGGTCAAGTTCCATAATAATTGAAGCTCGTCTTCCTT 

i , , , SerAsnGlyProValLysValTrpG lySerl leLysG 1 yLeuThrG 1 uG 1 yLeuH i sG 1 y 
m3 AGTAATGGACCAGTGAAGGTGTGGGGAAGCATTAAAGGACTGACTGAAGGCCTGCATGGA 
TCATTACCTGGTCACTTCCACACCCCTTCGTAATTTCCTGACTGACTTCCGGACGTACCT 

PheHisValHisGluPheGlyAs^ 
1503 TTCCATGTTCATGAGTTTGGAGATAATACAGCAGGCTGTACCAGTGCAGGTCCTCACTTT 
AAGGTACAAGTACTCAAACCTCTATTATGTCGTCCGACATGGTCACGTCCAGGAGTGAAA 

, c „ AsnProLeuSerArgLysHlsGlyGlyProLysAspGluGluA 
1563 AATCCTCTATCCAGAAAACACGGTGGGCCAAAGGA 

TTAGGAGATAGGTCTTTTGTGCCACCCGGTTTCCTACTTCTCTCCGTACAACCTCTGAAC 

GlyAsnValThrAlaAspLysAspGlyValAlaAspValSerlleGluAspSerVall le 
1623 GGCAATGTGACTGCTGACAAA^ 

CCGTTACACTGACGACTGTTTCTACCACACCGGCTACACAGATAACTTCTAAGACACTAG 

SerLeuSerGlyAspHisCysI lei leGlyArgThrLeuValValHisGluLvsAlaAsD 
1683 TCACTCTCAGGAGACCATTGCATCATTG^ 

AGTGAGAGTCCTCTGGTAACGTAGTAACCGGCGTGTGACCACCAGGTACTTTTTCGTCTA 



^fRt?y^X!ry?5I^ G Iy AsnGluG luSerThrLysThrGlyAsnAlaGlySerArgLeu 

171,3 8^WgEg^n!^g^5B^g^^^i?n^^8H^^88JgE^8Ig8ljJg 

. A 1 aCy sG 1 yya 1 1 1 eG 1 y 1 1 eA 1 aG 1 n A s nSe rG 1 yVa 1 G 1 y A 1 aMe t A 1 aMe t A 1 aSe r 

1803 ^IMJjf^^^ 

10 ,, ^??ED?AsnLeuProProValValAlaLysGluIleValAlaSerCysAspLysCysGln 
1n „ L?y!rysGlyGluAlaMetHisGlyGlnValAspCysSerProGlyIleTrpGlnLeLiA^ 

FIG. 24A 
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CysThrHisLeuGluGlyLysI lei leLeuValAlaValHisValAlaSerGlyTyrl le 
1983 TGTACACATCTAGAAGGAAAAATTATCCTGGTAGCAGTTCATGTAGCCAGTGGATATATA 
ACATGTGTAGATCTTCCTTTTTAATAGGACCATCGTCAAGTACATCGGTCACCTATATAT 

GluAlaGluVallleProAlaGluThrGlyGlnGluThrAlaTyrPheLeuLeuLysLeu 
2043 GAAGCAGAAGTTATTCCAGCAGAGACAGGGCAGGAAACAGCATATTTTCTCTTAAAATTA 
CTTCGTCTTCAATAAGGTCGTCTCTGTCCCGTCCTTTGTCGTATAAAAGAGAATTTTAAT 

AlaGlyArgTrpProValLysThrlleHisThrAspAsnGlySerAsnPheThrSerThr 
2103 GCAGGAAGATGGCCAGTAAAAACAATACATACAGACAATGGCAGCAATTTCACCAGTACT 
CGTCCTTCTACCGGTCATTTTTGTTATGTATGTCTGTTACCGTCGTTAAAGTGGTCATGA 

ThrValLysAlaAlaCysTrpTrpAlaGIyl leLysGlnGluPheGlyl leProTyrAsn 
2163 ACGGTTAAGGCCGCCTGTTGGTGGGCAGGGATCAAGCAGGAATTTGGCATTCCCTACAAT 
TGCCAATTCCGGCGGACAACCACCCGTCCCTAGTTCGTCCTTAAACCGTAAGGGATGTTA 

ProGlnSerGlnGlyValValGluSerMetAsnAsnGluLeuLysLysI lei leGlyGln 
2223 CCCCAAAGTCAAGGAGTAGTAGAATCTATGAATAATGAATTAAAGAAAATTATAGGACAG 
GGGGTTTCAGTTCCTCATCATCTTAGATACTTATTACTTAATTTCTTTTAATATCCTGTC 

ValArgAspGlnAlaGluHisLeuLysThrAlaValGlnMetAlaValPhelleHisAsn 
2283 GTAAGAGATCAGGCTGAACACCTTAAGACAGCAGTACAAATGGCAGTATTCATCCACAAT 
CATTCTCTAGTCCGACTTGTGGAATTCTGTCGTCATGTTTACCGTCATAAGTAGGTGTTA 

, MeLysArgLysGlyGlyl leGlyGlyTyrSerAlaGlyGluArgl leValAspI lei le 
2343 TTTAAAAGAAAAGGGGGGATTGGGGGATACAGTGCAGGGGAAAGAATAGTAGACATAATA 
AAATTTTCTTTTCCCCCCTAACCCCCTATGTCACGTCCCCTTTCTTATCATCTGTATTAT 

o,n, ^?TD^?RUf9i^TDtU^lyU uGlnL y s61nIleThrL ysIleGlnAsnPheArg 
2403 GCAACAGACATACAAACTAAAGAACTACAAAAGCAAATTACAAAAATTCAAAATTTTCGG 
CGTTGTCTGTATGTTTGATTTCTTGATGTTTTCGTTTAATGTTTTTAAGTTTTAAAAGCC 

yalTyrTyrArgAspAsnLysAspProLeuTrpLysGlyProAlaLysLeuLeuTrpLys 
2 ^ 53 GniAI]ACAGGGACAACAAAGATCCCCTTTGGA 

CAAATAATGTCCCTGTTGTTTCTAGGGGAAACCTTTCCTGGTCGTTTCGAAGAGACCTTT 

oco, GlyGluGlyAlaValVallleGlnAspAsnSerAspI leLysValValProArgArgLys 
2523 GGTGAAGGGGCAGTAGTAATA^ 

CCACTTCCCCGTCATCATTATGTTCTATTATCACTGTATTTTCATCACGGTTCTTCTTTT 



2583 



^ft^I^I-if^^RJyr^yLysG^MetAlaGlyAspAspCysValAlaSerArg 
GCAAAAATCATTAGGGATTAT^ 

CGTTTTTAGTAATCCCTAATACCTTTTGTCTACCGTCCACTACTAACACACCGTTCATCT 



GlnAspGluAspAM 
2643 CAGGATGAGGATTAG 
GTCCTACTCCTAATC 
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Sequence of SOD/env-4 
SOD 

MetAlaThrLysAlaValCysValLeuLysGlyAspGlyProValGlnGlyl lei leAsn 
1 CATGGCGACGAAGGCCGTGTGCGTGCTGAAGGGCGACGGCCCAGTGCAGGGCATCATCAAT 
CGCTGCTTCCGGCACACGCACGACTTCCCGCTGCCGGGTCACGTCCCGTAGTAGTTA 

PheGluGlnLysGluSerAsnGlyProValLysValTrpGlySerl leLysGlyLeuThr 
62 TTCGAGCAGAAGGAAAGTAATGGACCAGTGAAGGTGTGGGGAAGCATTAAAGGACTGACT 
AAGCTCGTCTTCCTTTCATTACCTGGTCACTTCCACACCCCTTCGTAATTTCCTGACTGA 

GluGlyLeuHisGlyPheHisValHisGluPheGlyAspAsnThrAlaGlyCysThrSer 
122 GAAGGCCTGCATGGATTCCATGTTCATGAGTTTGGAGATAATACAGCAGGCTGTACCAGT 
CTTCCGGACGTACCTAAGGTACAAGTACTCAAACCTCTATTATGTCGTCCGACATGGTCA 

AlaGlyProHisPheAsnProLeuSerArgLysHisGlyGlyProLysAspGluGluArg 
182 GCAGGTCCTCACTTTAATCCTCTATCCAGAAAACACGGTGGGCCAAAGGATGAAGAGAGG 
CGTCCAGGAGTGAAATTAGGAGATAGGTCTTTTGTGCCACCCGGTTTCCTACTTCTCTCC 

HisValGlyAspLeuGlyAsnValThrAlaAspLysAspGlyValAlaAspValSerlle 
212 CATGTTGGAGACTTGGGCAATGTGACTGCTGACAAAGATGGTGTGGCCGATGTGTCTATT 
GTACAACCTCTGAACCCGTTACACTGACGACTGTTTCTACCACACCGGCTACACAGATAA 

_ GluAspSerVallleSerLeuSerGlyAspHisCysI lei leGlyArgThrLeuValVal 
302 GAAGATTCTGTGATCTCACTCTCAGGAGACCATTGCATCATTGGCCGa 

CTTCTAAGACACTAGAGTGAGAGTCCTCTGGTAACGTAGTAACCGGCGTGTGACCACCAG 

„ n HisGluLysAlaAspAspLeuGlyLysGlyGlyAsnGluGluSerThrLysThrGlyAsn 
362 CATGAAAAAGCAGATGACTTGGG^ 

GTACTTTTTCGTCTACTGAACCCGTTTCCACCTTTACTTCTTTCATGTTTCTGTCCTTTG 

Envl--^ 

, „ AlaGlySerArgLeuAlaCysGIyVal IleGlhl leAlaMetGluValVall leAraSpr 
H22 GCTGGAAGTCGTTTGGCTTGTGGTGTAATT 

CGACCTTCAGCAAACCGAACACCACATTAACCCTAGCGGTACCTCCATCATTAATCTAGA 

, no AspAsnPheThrAsnAsnAlaLysThrllelleValGlnLeuAsnGluSerValAlalle 
™ 2 ^f??IIIHCGAACAATGCTAAAACCATAAT 

CTGTTAAAGTGCTTGTTACGATTTTGGTATTATCATGTCGACTTACTTAGACATCGTTAA 
c/,o A?P?XIIDr A f?^R A ? nAsnAsnTnrAr 9 L ysSerI leTyrl leGlyProGlyArgAla 
TTGACATGTTCTGGGTTGTTGTTATGTTCTTTTTCATAGATATATCCTGGTCCCTCTCGT 

FIG. 26A 
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PheHisThrThrGlyArgllel leGlyAspI leArgLysAlaHisCysAsnl leSerArg 
602 TTTCATACAACAGGAAGAATAATAGGAGATATAAGAAAAGCACATTGTAACATTAGTAGA 
AAAGTATGTTGTCCTTCTTATTATCCTCTATATTCTTTTCGTGTAACATTGTAATCATCT 

AlaGlnTrpAsnAsnThrLeuGluGlnl leValLysLysLeuArgGluGlnPheGlyAsn 
662 GCACAATGGAATAACACTTTAGAACAGATAGTTAAAAAATTAAGAGAACAGTTTGGGAAT 
CGTGTTACCTTATTGTGAAATCTTGTCTATCAATTTTTTAATTCTCTTGTCAAACCCTTA 

_ AsnLysThrl leValPheAsnGlnSerSerGlyGlyAspProGluI leValMetHisSer 
722 AATAAAACAATAGTCTTTAATCAATCCTCAGGAGGGGACCCAGAAATTGTAATGCACAGT 
TTATTTTGTTATCAGAAATTAGTTAGGAGTCCTCCCCTGGGTCTTTAACATTACGTGTCA 

Ph? A fQCysArgGlyGluPhePh^ 
782 TTTAATTGTAGAGGGGAATTTTTCTACTGTAATACAACACAACTGTTTAATAATACATGG 
AAATTAACATCTCCCCTTAAAAAGATGACATTATGTTGTGTTGACAAATTATTATGTACC 

„,„ ArgLeuAsnHisThrGluGlyThrLysGlyAsnAspThrllelleLeuProCysArglle 
842 AGGTTAAATCACACTGAAGGAACTAAAGGAAATGACACAATCATACTCCCATGTAGAATA 
TCCAATTTAGTGTGACTTCCTTGATTTCCTTTACTGTGTTAGTATGAGGGTACATCTTAT 

nnn LysGlnl lei leAsnMetTrpGlnGluValGiyLysAlaMetTyrAlaProProI leGly 
902 AAACAAATTATAAACATGTGGC^ 

TTTGTTTAATATTTGTACACCGTCCTTCATCCTTTTCGTTACATACGGGGAGGGTAACCT 

oco ^^^U?^X?l?r§?!: A ?Oi leThrG1 y LeuLeuL euThrArgAspGlyGlyThr 
962 GGACAAATTAGTTGTTCATCAAATATTACAGGGCTGCTATTAACAAGAGATGGTGGTACA 
CCTGTTTAATCAACAAGTAGTTTATAATGTCCCGACGATAATTGTTCTCTACCACCATGT 

, „ AsnVa IThrAsnAspThrG 1 uVa 1 PheArgProG 1 yGl yG 1 vAsDMpr AraA<;r>A<;nTrn 

1022 WiaWfl^JGACACCGAGGTCT 

TTACATTGATTACTGTGGCTCCAGAAGTCTGGACCTCCTCCTCTATACTCCCTGTTAACC 

FIG. 26B 
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